wo 2005/049814 PCT/NL2004/000805 

1/87 



Fig. 1 



A6ATAGAGAATTTTCTTATTTAGACTTTGTGTCTACTCCTCTCAACTAAACGAAATTTTTCTAGT6CTGTCATTTGTTAT66CAGTCCTAGT 

» I ■ ■ ■ ■ I I ■ . , ■ I ■ ■ , . I I , . - I ■ . . . ■ I I ■ ■ , I 92 

TCTATCTCTTAAAAGAATAAATCTGAAACACAGATGAGGAGAGTT(3ATTtGCTTTAAAAAGATCACGACA6TAAACAATACC6TCAGGATCA 



GTAATTGAAATTTCGTCAAGTTTGTAAACTGGTTAGGCAAGTGTTGTATTTTCTGTGTTTAAGCACTGGTGGTTCTGTCCACTAGTGCACAC 

. 1 I I I I ■ ■ . ■ I ■ ■ — . I I i I ■ . . ■ 184 

CATTAACTTTAAAGCAGTTCAAACATTTGACCAATCCGTTCACAACATAAAAGACACAAATTCGTGACCACCAAGACAGGTGATCACGTGTG 



5'UTR 



ATTGATACTTAAGTG6TGTTCTGTCACTGCTTATTGTGGAA6CAACGTTCTGTCGTTGTGGAAACCAATAACTGCTAACCATGTTTTACAAT 

I I I ■ ■ i - l ■ ■ I ■ ■ I ■ I I I I I : ■ .1 ■ : I C 276 

TAACTATGAATTCACCACAAGACAGTGACGAATAACACCTTCGTT6CAAGACAGCAACACCTTTGGTTATT6ACGATTGGTACAAAAT6TTA 

— ^^"i^— 5*UTR^— — — i^^"—— — i-HIL M F Y N 

^Repijcase 1a— 

CAAGTGACACTTGCTGTTGCAAGTGATTCGGAAATTTCAGGTTTTGGTTTTGCCATTCCTTCTGTAGCCGTTCGCGCTTATAGCGAAGCCGC 

■ ' ■ I I I ■ ■ ■ ■ I ■ ■ I ■ ■ ■ ■ I ■ ' ■ ' I ' i 1 I I , ■ ■ ■ I I ■ ■ 368 

GTTCACTGTGAACGACAACGTTGACTAAGCCTTTAAAGTCCAAAACCAAAACGGTAAGGAAGACATCGGCAAGCGCGAATATCGCTTCGGCG 

GVTLAVASOSE I SGFGFAI PSVAVRAYSEAA 
Replicase la ' 



TGCACAAGGTTTTCAG6CAT6CC6CTTTGTTGCTTTTG6CTTACAG6ATT6TGTAACC6GTATTAATGAT6ACGATTATGTCATTGCATTGA 

i I I ■ ■ ■ . I I I : ■ I I r ■ ■ ■ ■ I ' ■ ■ I I ■ ■ ■ I ' I I 460 

ACGTGTTCCAAAAGTCCGTACGGCGAAACAACGAAAACCGAATGTCCTAACACATTGGCCATAATTACTACTGCTAATACAGTAACGTAACT 

AQGFOACRFVA'FGLOOCVT.G I NODDYV I AL 

Replicase la 



CTGGTACTAATCAGCTTTGTGCCAAAATTTTACTTTTTTCTGATA6ACCTCTTAATTTGCGA6GTTGGCTCATTTTTTCTAACAGCAATTAT 

■ ■ ' " ' - I i I . ■ ■ ■ I t : . I I I I ■ ■ I | - . ■ ■ I i ■ I 652 

.GACCATGATTAGTCGAAACACGGTTTTAAAATGAAAAAAGACTATCTGGAGAATTAAACGCTCCAACCGAGTAAAAAAGATTGTCGTTAATA 

TGTN. GLCAK ILLFSDRPLNLRG WLIFSNSNY 

Replicase la 

GTTCTTCAGGACTTTGATGTTGTTTTTGGCCATGGTGCAGGAAGTGTGGTTTTTGTGGATAAGTATATGTGTGGTtTTGATGGTAAACCTGT 

I I 1 I I ' ■ I I I . ■ > ■ I 1 1 — ^ 644 

CAAGAAGTCCTGAAACTACAACAAAAACCGGTACCACGTCCTTCACACCAAAAACACCTATTCATATACACACCAAAACTACCATTTGGACA 



VLQDFDVVFGHGAGSVVFVDKYMCGFDGKPV 
. Replicase 1 a • 

'6TTACCTAAAAACATGTGGGAATTTAGAGATTACTTTAATGATAATACTGATAGTATTGTTATTGGTGGTGTCACTTATCAATTA6CATG6G 

I ■ I ■ ■ ' » i ■ ' I ' I ' ■ ' I ' ■ ■ I I I I ■ ■ ■ I ■ ■ ■ I I ■ ■ ■ I I I . t il I ■ 736 

CAATGGATTTTTGTACACCCTTAAATCTCTAATGAAATTACTATTATGACTATCATAACAATAACCACCACAGTGAATAGTTAATCGTACCC 



LPKNMWEFRDYFNONTOS I V I GGVTYQLAW 
Replicase la = 

ATGTTATACGTAAAGACCTTTCTTATGAACAGCAAAATGTTTTAGCTATTGAGAGCATTCATTATCTTGGCACTACAGGTCATACTTTGAAG 

' I ' I ' I I I . ■ ■ . . I ■ I ■ I 1 I ■ ■ ■ 1 I 828 

TACAATAT6CATTTCTGGAAAGAATACTTGTC6TTTTACAAAATCGATAACTCTCGTAAGTAATAGAACCGTGATGTCCAGTATGAAACTTC 

DV I.RKDLSY EOONVLAI ES I HYLGTTGH TLK 
Replicase 1a 
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TCT6GTTGCAAACTCATTAATGCCAAGCCGCCTAAATATTCTTCTAAGGTTGTTTTGAGTGGTGAATGGAATGCTGTGTATAAGGCGTTTGG 
■ I ■ ■ ■ ■ ' ' ■ ' » ' ' I ■ ■ ■ ■ I I I ■ . ■ ■ I I ■ I • i i I I I I , , , , I , , , ■ I I 

AGACCAACGTTTGAGTAATTACGGTTCGGCGGATTTATAAGAAGATTCCAACAAAACTCACCACTTACCTTACGACACATATTCCGCAAACC 

SGCKL 1 NAKPPKYSSKVVLSGEWNAVYKAFG 
■ Replicase 1 a 



TTCACCATTTATTACAAATGGTATATCATTGCTAGATATAATTGTTAAACCAGTTTTCTTTAATGCTTTTGTTAAATGCAATTGTG6TTCTG 

I I I ■ ■ I ' ■ ■ ■ I I I r > * ■ ■ I I ■ ■ 1012 

AAGTGGTAAATAATGTTTACCATATAGTAACGATCTATATTAACAATTT6GTCAAAAGAAATTACGAAAACAATTTACGTTAACACCAAGAC 

SPFITNGISLLDI I. VKPVFFNAFVKCNCGS 
^— —— Replicase 1 a , 



AGAATTGGAGT6TTGGTGCATGGGATGGTTATCTATCTTCTTGTTGTGGCACACCTGCTAAGAAACTTTGT6TTGTTCCTGGTAATGTTGTT 

I ■ ■ ■ I I . ■ I I I I I ■ ■ ■ ■ I ■ ■ ■ . 1104 

TCTTAACCTCACAACCACGTACCCTACCAATAGATAGAAGAACAACACCGTGTGGACGATTCTTTGAAACACAACAAGGACCATTACAACAA 

ENWSVGAWDGYLSSCCGTPAKKLCVVPGNVV 

" Replicase la 



CCTGGTGATGTGATCATCACCTCAACTGATGCTGGTTGTGGTGTTAAATACTATGCTGGCTTAGTTGTTAAACATATfACTAACATTACTGG 

I I 1 I I I I i I ■ ■ ■ ■ I ■ 1196 

GGACCACTACACTAGTAGTGGAGTTGACTACGACCAACACCACAATTTATGATACGACCGAATCAACAATTTGTATAATGATTGTAATGACC 

PGDVI ITS TDA&CG'VKYYAGLVVKHITNITG 

Replicase la 



TGTGTCTTTATGGCGTGTTACAGCTGTTCATTCTGAT6GAAT6TTTGTGGCAACATCTTCTTATGATGCACTTTTGCATAGAAATTCATTAG 

■ ' > I H 1 1 i ' ■ ' I r« 1 1 ' I ' ■ . i .-l I I 1 1288 

acacagaaataccgcacaatgtcgacaagtaagactaccttacaaacaccgttgtagaagaat'actacgtgaaaacgtatctttaagtaatc 

vslwrvtavhsogmf.vatss.,ydallh;rnsl 

Replicase 1 a ■ ' 



accctttttgctttgatgttaacactttactttctaatcaattacgtctagcttttcttggtgcttctgttacagaagatgttaaatttgct 

■ i I I I I I I I . . ■ ■ I , ■ , ■ I I 1380 

TGGGAAAAACGAAACTACAATTGTGAAATGAAAGATTAGTTAATGCAGATCGAAAAGAACpACGAAGACAATGTCTTCTACAATTTAAACGA 

dpfcfovntllsnqlrlaflgasvtedvkfa 
• Replicase la 



gctagcactggtgttattgacattagtgctggtatgtttggtctttacgatgacatattgacaaacaataaaccttggtttgtacgcaaagc 

I I I I .... I .... i I I I ■ ■ , . I ■ ■ 1472 

cgatcgtgaccacaataactgtaatcacgaccatacaaaccagaaatgctactgtataactgtttgttatttggaaccaaacatgcgtttcg 

astgvio i sagmfglyooittnnkpwfvrka 

Replicase 1 a . — . 



TTCTGGGCTTTTTGATGCAATCTGGGATGCTTTTGTTGCCGCTATTAAGCTtGTGCCAACTACTACTGGTGGTTTGGTTAGGTTTGTTAAGT 

^ 'I I I .... I 1 1 I I I ■ I 1664 

aagacccgaaaaactacgttagaccctacgaaaacaacggcgataattcgaacacggttgatgatgaccaccaaaccaatccaaacaattca 

sglfoaiwoafvaaiklvptttgglvrfvk 

Replicase 1a 



ctatcgcttcaactgttttaactgtttctaatggtgttattattatgtgtgcagatgttccagatgcttttcaaccagtttaccgcacattt 

i ■ ' i i ■ i i ■ i i i i i ■ » 1656 

gatagcgaagttgacaaaattgacaaagattaccacaataataatacacacgtctacaaggtctacgaaaagttggtcaaatggcgtgtaaa 

siastvltvsn6vi i mcadvpoafopvyrtf 

Replicase la 
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ACACAAGCTATTTGTGCTGCATTTGATTTTTCTTTAGATGTATTTAAAATTG6TGATGTTAAATTTAAACGACTTGGTGATTATGTTCTTAC 

■ " » I I ■ ■ , I , , . , I ■ I 1 ^ I I . , ■ ■ I ■ , , . I I , J ^y^Q 

TGTGTTC6ATAAACACGACGTAAACTAAAAAGAAATCTACATAAATTTTAACCACTACAATTTAAATTTGCTGAACCACTAATACAAGAATG 

TQAICAAFOFSLDVFK I GOVKFKRLGOYVLT 

RepHcase 1a^ — 



TGAAAATGCTCTTGTTCGTTTGACTACTGAA6TTGTTCGTGGTGTTCGTGATGCTCGCATAAAGAAAGCCATGTTTACTAAAGTA6TTGTA6 

' t ' ' I I ' ' ' I ■ ' ' ' I ■ ' ' ' ' ' ' I ' » 1 ■ ■ ■ ■ I ■ ■ ■ ' I I 1840 

ACTTTTACGAGAACAAGCAAACTGATGACTTCAACAAGCACCACAAGCACTACGAGCGTATTTCTTTCGGTACAAATGATTTCATCAACATC 

ENALVRLTTEVVRGVROAR I KKvAMFTKVVV 

Replicase 1a 



6TCCTACAACTGAAGTTAAGTTTTCTGTTATTGAACTTGCCACTGTTAATTTGCGTCTTGTTGATTGTGCACCTGTA6TTT6CCCTAAAG6T 

I ' ' ' I ■ ■ ■ I I i ■ ■ I I I I t ■ ' 1932 

CAGGATGTTGACTTCAATTCAAAAGACAATAACTT6AAC6GTGACAATTAAACGCAGAACAACTAACAC6T6GACATCAAACG6GATTTCCA 

GPTTEVKFSV I ELATVNLRLVDCAPVVCP KG 

• Replicase 1 a 



AAAATTGTTGTTATTGCTGGACAAGCTTTTTTCTATA6TGGT6GTTTTTATCGTTTTATGGTTGATTCTACAACTGTATTAAATGACCCTGT 

I I I I I I I I ■ ■ ■ I ■ I . . ■ ■ ' 1 ■ ■ ■ ■ 2024 

TTTTAACAACAATAAC6ACCT6TTCGAAAAAAGATATCACCACCAAAAATAGCAAAATACCAACTAAGATGTTGACATAATTTACTGGGACA 

KIVVIAGQAFFYSGGFYRFMVDSTTVLNOPV 
' Replicase 1 a — ^— ^— — ^— 



TTTTACT6GTGAGTTATTTTATACTATTAAGTTTAGTG6TTTTAAG£TTGATGGTTTTAACCATCAGTTTGTTAATGCTAGTTCTGCTACAG 

I I ■■■■ > ■■■. I I ■ > I . I ■ , I , • I r I I ■ I . ■ ■ ■ ■ I 21 16 

AAAATGACCACTCAATAAAATATGATAATTCAAATCACCAAAATTCGAACTACCAAAATTGGTAGTCAAACAATTACGATCAAGACGAT6TC 

FTGELFYT I KFSGFK.LDGFNH. QFVNASSAT 
^— — — Replicase 1 a ■ ^ 



ATGCCATTATTGCTGTTGAGCTGTTGTTATCGGATTTTAAAACTGCAGTTTTTGTGTACACATGTGTGGTTGATGGTTGTAGTGTCATTGTT 

-^^H 1 : ■ I I I I II I 1 ' ■ ■ ■ I I 1 1 ' ' I I I I I I ■■ I 2208 

TACGGTAATAACGACAACTCGACAACAATAGCCTAAAATTTT6ACGTCAAAAACACATGTGTACACACCAACTACCAACATCACAGTAACAA 

DAI lAVELLLSDFKTAVFVYTCVVDGCSVl 'v 

Replicase la 



AGACGTGATGCTACATTCGCCACACATGTGTGTTTTAAGGACTGTTATAGTATTTGGGAGCAATTCTGCATTGATAATTGTGGTGAGCCATG 

■ I I I 1 I I I 1 1 I 2300 

TCTGCACTACGATGTAAGCGGTGTGTACACACAAAATTCCTGACAATATCATAAACCCTCGTTAAGACGTAACTATTAACACCACTCG6TAC 

RROATFATHVCFKDCYSIWEOFC I DNC6EPW 
Replicase 1 a : 



GTTTTTGACTGATTATAATGCTATCTTGCAGAGTAATAACCCTCAATGTGCTATTGTTCAAGCATCGGAGTCTAAAGTTTT6CTTGAGAGGT 

' ' i I i I ■ I ■ ■ ■ . I I ■ ■ ■ I i I I I I ■ I ■ . 2392 

CAAAAACTGACTAATATTACGATAGAACGTCTCATTATTGGGAGTTACACGATAACAAGTTCGTAGCCTCAGATTTCAAAAC6AACTCTCCA 

FLTOYNAILQSNNPQCAIVOASESKVLLER 

Replicase la 



TTTTACCTAAGTGTCCTGAAATACTGTTGAGTATTGATGATGGCCATTTATGGAATCTTTTTGTTGAAAAGTTTAATTTTGTTACAGATTGG 

' ■ I 1 I ■ . ■ ■ I I I ■ ■ I I I , I I . I I I I I I i I I 2484 

AAAATGGATTCACAGGACTTTATGACAACTCATAACTACTACCGGTAAATACCTTAGAAAAACAACTTTTCAAATTAAAACAAT6TCTAACC 



FLPKCPEILLSIODGHLWNLFVEKFNFVTDW 

Replicase la ■ — 
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TTAAAAACTCTTAAGCTTACACTTACTTCTAATGGTCTTTTAGGTAATTGTGCCAAACGTTTTAGACGTGTTTTG6TAAAATTGCTTGATGT 

^ -*4-- I • ^ \ I t I i ' I I I I III 2576 

AATTTTTGAGAATTCGAATGTGAATGAAGATTACCAGAAAATCCATTAACACGGTTTGCAAAATCTGCACAAAAGCATTTTAACGAACTACA 

LKTLKLTLTSNGLLGNCAKRFRRVLVKLLOV 
'• Replicase la — — 



CTATAATG6TTTTCTTGAAACTGTCTGTAGTGTCGTACACACTGCTGGTGTTTGCATTAAATATTATGCTGTTAATGTTCCATATGTAGTTA 

■ I 1 I I I ■ ■ ■ I I I I I ■ ■ ■ I I I , , I 2668 

GATATTACCAAAAGAACTTTGACAGACATCACAGCATGTGTGACGACCACAAACGTAATTTATAATACGACAATTACAAG6TATACATCAAT 

YN.6FLETVCSVVHTAGVC I KYYvAVNVPYVV 
— — ^ -Replicase la 



TTAGTGGTTTTGTAAGTCGTGTAATTCGTA6A6AAAG6T6TGAC6TGACTTTTCCTTCTGTTAGTT6TGTCACTTTTTTCTATGAATTTTTA 

■I I ' I I 1 ■ I 1 1 I ■ t - I ■ I h 2760 

AATCACCAAAACATTCAGCACATTAAGCATCtCTTTCCACACTGCACTGAAAAGGAACACAATCAACACAGTGAAAAAAGATACTTAAAAAT 

ISGFVSRVIRRERCDVTFPCVSCVTFFYEFL 
Replicase 1a 



GACACGTGTTTTGGTGTTAGTAAACCTAATGCCATTGATGTTGAACATTTAGAGCTTAAAGAAACTGTTTTTGTTGAACCTAAGGATGGTGG 

1 , 1 I .... t ... I i I ■ ■ r . ■ , ■ i ■ ■ ■ ■ I ■ ■ . . I I I . ■ 2852 

CTGT6CACAAAACCACAATCATTTGGATTACGGTAACTACAACTTGTAAATCTCGAATTTCTTTGACAAAAACAACTTGGATTCCTACCACC 

OTCFGVSKPNA I OVEHLELKET .VFVEPKOGG 

Replicase 1 a — — — — 



TCAATTTTTTGTTTCT6ATGATTATCTTTGGTATGTTGTAGATGACATTTATTATCCAGCTTCATGTAATG6TGTATTGCCAGTTGCTTTTA 

I I \ i n ' ■■ I i - ' ' I i I . 1^1 ■ ■ ■ I ■ ■ : I ■ ■ 2944 

AGTTAAAAAACAAAGACTACTAATAGAAACCATACAACATCTACTGTAAATAATAGGTCGAAGTACATTACCACATAACGGTCAACGAAAAT 

OFFVSDOYLWYVV DDI YYPASCN6VLPVAF 
— — — — ^ — Replicase 1a s : — 



CAAAATTGGCAGGTG6TAAAATATCTTTTTCTGATGATGTTATAGTTCATGATGTTGAACCTACCCATAAAGTCAAGCTCATATTT6AGTTT 

I I ■ ■ , ■ I , . ■ . I I I I , ■ ■ ■ I . . ■ . I I I .... I ■ 3036 

6TTTTAACCGTCCACCATTTTATAGAAAAA6ACTACTACAATATCAA6TACTACAACTT6GATGG6TATTTCAGTTCGAGTATAAACTCAAA 

TKLAGGK ISFSODVIVHOVEPTHKVKLIFEF 
— — — ^— — ^— ^— Replicase 1 a 



GAAGATGATGTTGTTACCAGTCTTTGTAAGAAGAGTTTTGGTAAGTCTATTATTTATACAGGTGATTGGGAAGGTTTACATGAAGTTCTTAC 

■ " I I I I ■ , I ■ I , I I 1 ■ ■ " 1 " ■ ■ I I I . . . . I . . > 3128 

CTTCTACTACAACAATGGTCAGAAACATTCTTCTCAAAACCATTCAGATAATAAATATGTCCACTAACCCTTCCAAATGTACTTCAAGAATG 

ED0VVTSLCKKSF6KSI IYTGDWE6LHEVLT 

Replicase la . 



ATCTGCAATGAATGTCATTGGGCAACATATTAAGTTGCCACAATTTTATATTTATGAT6AAGAG6GTGGTTATGATGTTTCTAAACCAGTTA 

■ ■ I ■ ■ I ■ ■ I ■ I : I i ■ ■ ■ i I I I I ■ ■ ■ I I , ■ ■ ■ I I 3220 

TAGAC6TTACTTACAGTAACCC6TTGTATAATTCAAC6GTGTTAAAATATAAATACTACTTCTCCCACCAATACTACAAAGATTTGGTCAAT 

SAMNVIGQHIKLPOFYIYDEEGGYOVSKPV 
— Replicase 1 a 



T6ATTTCACAATGGCCTATTAGTGATGATAGTGAT6GTTGTGTTGTTGAAGCGAGCACT6ATTTTCATCAATTAGAATCT6TTAGAGAAGAG 

I ■ I ■ I ■ ■ ■ I I I ' I I I I I 1 / ■ 3312 

ACTAAAGTGTTACCGGATAATCACTACTATCACTACCAACACAACAACTTCGCTCGTGACTAAAA6TA6TTAATCTTA6ACAATCTCTTCTC 

Ml SQWP I S00SD 6CVVEAST0FHQLESVREE 

Replicase la 
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GTTGATATAATTGAACAACCTTTTGGGGAAGTTGAACATGCGCTCTCAATTAGACAACCTTTTTCTTTTTCTTTTAGAGAJGAATTGGGTGT 

— ' ' ■ ' * I I ' ' ' I I I I I ■ . . . 31104 

CAACTATATTAACTT6TTGGAAAACCCCTTCAACTTGTACGCGAGAGTTAATCTGTTG6AAAAAGAAAAAGAAAATCTCTACTTAACCCACA 

VDI lEQPFGEVEHALSI RQPFSFSFROEL GV 
[ — rReplicase la 



TC6TGTTTTAGATCAATCTGATAATAATTGTTGGATTAGTACCACACTTATACAGTTGCAACTTACAAAGCTTTTGGATGATTCTATT6AGA 

1 1 ' I ' • 1 I 1 1 i 1 1 »i 1 ' ■ ' i ' ' . . I ■ 3^*96 

AGCACAAAATCTAGTTAGACTATTATTAACAACCTAATCATGGTGTGAATATGTCAACGTTGAATGTTTCGAAAACCTACTAAGATAACTCT 

RVLDQSDNNCWISTTL I QLQLT, KLLODS I E 

• Replicase 1 a — — — — — — ^— — — 



T6CAATTGTTTAAAGTTGGTAAAGTTGATTCAATTGTTCAAAAGTGTTATGAGTT6TCTCATTTAATTAGTG6TTCACTTGGTGATAGT6GT 

' ■ ■ i I I I ' > ■ i I ■ ■ ■ I I ■ . ■ ■ I . 1 I 3588 

ACGTTAACAAATTTCAACCATTTCAACTAAGTTAACAAGTTTTCACAATACTCAACAGAGTAAATTAATCACCAAGT6AACCACTATCACCA 

MQLFKVGKVOSIVOKCYELSHLISGSLGOSG 

Replicase la 



AAACTTCTTAGTGAACTTCTTAAAGATAAATATACATGTTCTATAACTTTT6AGATGTCTT6TGATTGTGGTAAAAAGTTTGATGA6CAAGT 

■ I I I i I I I I I 3680 

TTTGAAGAATCACTTGAAGAATTTCTATTTATATGTACAAGATATTGAAAACTCTACAGAACACTAACACCATTTTTCAAACTACTC6TTCA 

KLLSELLKDKYTCSITF EMSCDCGKKFOEQV 

Replicase la 



TGGTTGTTTGTTTTG6ATTATGCCTTACACAAAACTTTTTCAAAAAGGTGAGTGTTGTATTTGTCATAAAATGCAGACTTATAAGCTTGTTA 

' ■ ' ' I I ■ I , ■ t ■ I , I I 1 I < — i — I 1 'I I 1 I I . I ■ ■ I « — H 3772 

ACCAACAAACAAAACCTAATACGGAATGTGTTTTGAAAAAGTTTTTCCACTCACAACATAAACAGTATTTTACGTCTGAATATTCGAACAAT 

GCUFWIMPYT KLFOKGECCICHKHOTYKLV 

Replicase 1a . . 



6TATGAAAGGTACTGGTGTGTTTGTACAGGATCCAGCACCTATTGACATTGATGCTTTCCCTGTTAGACCTATATGTTCATCTGTATATTTA 

I I I ' ■ I I I I I I ■ I I I ■ ■ I I I ■ ■ . ■ 3864 

CATACTTTCCATGACCACACAAACATGTCCTAGGTCGTGGATAACTGTAACTACGAAAGGGACAATCTGGATATACAAGTAGACATATAAAT 

SMKGTGVFVOOPA.P I D I OAF PVR P I C SSVYL 

Replicase 1a 



G6TGTTAAGGGTTCTGGTCATTATCAAACAAATTTATACAGTTTTGACAAAGCTATT6ATGGTTTTGGTGTCTTTGACATTAAAAATA6TAG 

I I I I I ■ ■ ■ ■ I 1 I I I 3966 

CCACAATTCCCAA6ACCAGTAATAGTTTGTTTAAATATGTCAAAACT(5TTTCGATAACTACCAAAACCACAGAAACTGTAATTTTTATCATC 

GVKGSGHYQTNLYSFOKAIDGFGVFDIKNSS 

Replicase la 



TGTTAATACTGTTTGTTTTGTTGATGTTGATTTTCATAGTGTAGAAATAGAAGCTGGTGAAGTTAAACCTTTTGCTGTATATAAAAATGTTA 

I i 1 1 I 1 1 1 ■ I ■ I I ■ ■ 4048 

ACAATTATGACAAACAAAACAACTACAACTAAAAGTATCACATCTTTATCTTC6ACCACTTCAATTTGGAAAACGACATATATTTTTACAAT 

VNTVCFVDVDFHSVE lE AGEVKPFAVYKNV 

— Replicase 1 a ^— 



AATTTTATTTAGGTGATATTTCACACCTTGTAAACTGTGTTTCTTTTGACTTTGTTGTCAATGCTGCTAATGAAAATCTCATGCATGGAGGC 

I ' I ■ ■ I I ' ■ i I . . .. I ... I I I I I , I 4140 

TTAAAATAAATCCACTATAAAGT6TGGAACATTTGACACAAAGAAAACTGAAACAACAGTTACGACGATTACTTTTAGAGTACGTACCTCCG 

K FYLGO I SHLVNCVSFOFVVNAANENLMHGG 
— — ^ — —Replicase la 
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GGTGTCGCACGTGCTATTGATATTTTGACTGAAGGTCAACTTCAGTCATTATCTAAAGATT:ACATTAGTAGTAATGGTCCACTTAAGGTTGG 

I I I ■ ■ ■ i I I I ' ■ I I ■ I I I I T I , I 4232 

CCACAGCGTGCACGATAACTATAAAACTGACTTCCAGTTGAAGTCAGTAATAGATTTCTAAT6TAATCATCATTACCA6GTGAATTCCAACC 

GVA. RA I 0 1 LTEGOLQSLSKDY I SSNGPLK VG 
Replicase 1 a ■ 



AGCAGGTGTTATGTTGGAGTGTGAAAAATTCAATGTATTTAATGTTGTTGGTCCGCGAACTGGTAAACATGAGCATTCATTACTTGTTGAAG 

1 ■ ■ ■ ■ I 1 | - I ■ ■ I I ■ ■ ■ ■ I — ' ■ I ■ ■ ■ ■ I ■ ■ ' ■ I ■ ■ ■ ■ i\32^ 

TCGTCCACAATACAACCTCACACTTTTTAAGTTACATAAATTACAACAACCAGGCGCTTGACCATTT6TACTCGTAAGTAATGAACAACTTC 

AGVMLECEKFNVFNVVGPRTGKHEHSLLVE 
— — ^ Replicase la 



CTTATAATTCTATTTTATTTGAAAATGGTATTCCACTTATGCCTCTTCTTAGTTGTGGTATTTTTGGTGTAAGGATTGAAAATTCTCTTAAA 

I I I . . I I ■ I i I 1 - I I I ' ■ ' I I 4^16 

6AATATTAAGATAAAATAAACTTTTACCATAA6GTGAATACG6AGAAGAATCAACACCATAAAAACCACATTCCTAACTTTTAAGA6AATTT 

AYNSILFENGIPLMPLLSCG- IFGVRIENSLK 

Replicase la 



GCTTTGTTTAGTTGTGACATTAATAAACCATTGCAAGTTTTTGTTTATTCTTCAAATGAAGAACAAGCTGTTCTTAAGTTTTTAGAT66TTT 

1 ■ ■ ■ ■ I ■ 1 ' ■ I i I ■ • I r ■ ■ ■ ■ I ■ ■ ' ■ I 4508 

CGAAACAAATCAACACTGTAATTATTTGGTAACGTTCAAAAACAAATAAGAAGTTTACTTCTTGTTCGACAA6AATTCAAAAATCTACCAAA 

ALFSCD I NKPLQVFVYSSNEEQAVLKFLOGL 
Replicase 1 a : 



AGATTTAACACCAGTCATTGACGATGTTGATGTTGTTAAACCTTTTAGAGTTGAAGGTAATTTTTCATTCTTTGATTGTGGTGTCAATGCCT 

^ 1 i 1 ■ ■ ■ ■ I 1 ^ H 1 H — — ? I- ■ I • h I '♦eoo 

TCTAAATTGTG6TCAGTAACTGCTACAACTACAACAATTTGGAAAATCTCAACTTCCATTAAAAAGTAAGAAACTAACACCACAGTTACGGA 

OLTPV I DOVDVVKPFRVEGNF SFFOCGVNA- 
— — ! — ^ Replicase 1a ' , r 



TGGATGGTGATATTTACTTATTATTTACTAACTCTATTTTAATGTTGGATAAACAAGGACAATTATTGGACACAAAACTTAATGGTATTTTG 

I ■ ' I 1 * ■>■> ! L I ' I I I I I ■ ■ 4692 

ACCTACCACTATAAAT6AATAATAAATGATTGAGATAAAATTACAACCTATTTGTTCCTGTTAATAACCT6TGTTTTGAATTACCATAAAAC 

LOGDIYLLFTNSILMLDKGGO LLDTKLNGIL 

Replicase la ' 



CAACAGGCAGTTCTTGATTATCTTGCTACAGTTAAAACTGTACCA6CTGGTAATTT6GTTAAACTTGTT6TTGAGAGTTGTACCATTTATAT 

I I I ■ ■ I t I I . ■ ■ ■ I I I i ■ ■ I 1— H 1 1 4784 

GTTGTCCGTCAA6AACTAATAGAACGAT6TCAATTTTGACATGGTCGACCATTAAACCAATTTGAACAACAACTCTCAACATGGTAAATATA 

QOAVLDYLATVKTVPAGNLVKLVVESCT I YM 
— f- Replicase 1 a . ' 



GTGTGTTGTACCATCGATAAATGATCTTTCTTTTGATAAAAATCTTGGTCGTTGTGTGCGTAAACTTAATAGATTGAAAACTTGTGTTATTG 

I ■ ■ I r ' I ■ ■ I I I I I ' ' ' I I 4876 

CACACAACATGGTAGCTATTTACTAGAAAGAAAACTATTTTTAGAACCAGCAACACACGCATTTGAATTATCTAACTTTT6AACACAATAAC 

CVVPS I NDLSFDKNLGRCVRKLNRLKTCV I 
— — — — Replicase 1 a 



CCAATGTTCCTGCTATTGATGTTTTGAAAAA6CTTCTTTCAA6TTTGACTTTAACT6TTAAATTTGTTGTAGAGA6TAATGTTATGGAT6TT 

I . . 1 -I 1 1 ■ ■ ■ ■ I i ' ■ I • I I ■ I I ■ I ■ I I I I 4968 

GGTTACAAGGACGATAACTACAAAACTTTTTCGAAGAAAGTTCAAACTGAAATTGACAATTTAAACAACATCTCTCATTACAATACCTACAA 

ANVPA I OVLKKLLSSLT LTVKFVVESNVMDV 

Replicase la . 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/00080S 

7/87 



AACGACTGTTTTAAGAATGATAATGTAGTTTTGAAAATTACTGAAGATGGTATTAATGTTAAAGATGTTGTTGTTGAGTCTTCTAAGTCACT 

' I 1 I I I I I . ' I I I 5060 

TTGCTGACAAAATTCTTACTATTACATCAAAACTTTTAATGACTTCTACCATAATTACAATTTCTACAACAACAACTCAGAAGATTCAGTGA 

NDCFKNONVVLK I TE DG I NVKOVVVESSK SL 

Replicase 1 a 



TGGTAAACAATTGGGTGTTGTGAGTGATGGTGTTGACTCTTtTGAAGGTGTTTTACCTATTAATACTGATACTGTCTTATCTGTAGCTCCAG 

■ ■ I ■ ■ I ' ■ ' ' I h-; 1 I I I I i ■ ■ 5152 

ACCATTTGTTAACCCACAACACTCACTACCACAACTGAGAAAACTTCCACAAAATGGATAATTATGACTATGACAGAATAGACATC6AGGTC 

GKOLGVVSDGVDSFEGVLP INTOTVLSVAP 

Replicase 1a 



AAGTTGACTGGGTTGCTTTTTACGGTTTTGAAAAGGCAGCACTTTTTGCTTCTTTGGATGTAAA6CCATATGGTTACCCTAATGATTTT6TT 

— I I I I I I I ' I ■ ■ I I 5244 

TTCAACTGACCCAACGAAAAATGCCAAAACTTTTCCGTCGTGAAAAACGAAGAAACCTACATTTCGGTATACCAATGGGATTACTAAAACAA 

EVOWVAFYGFEKAALFASLDVKPYGYPNDFV 

Replicase la 



GGTGGTTTTAGAGTTCTTGG6ACCACCGACAATAATTGTTG6GTTAATGCAACTTGTATAATTTTACAGTATCTTAAGCCTACTTTTAAATC 

H 1 1 • i I I i . I i 6336 

CCACCAAAATCTCAAGAACCCT66TGGCTGTTATTAACAACCCAATTACGTTGAACATATTAAAATGTCATAGAATTCG6ATGAAAATTTA6 

GGFRVLGTTDNNCWVNATCI ILOYLKPTFKS 

Replicase 1a 



TAAGGGTTTAAATGTTCTTTGGAACAAATTTGTTACAGGTGATGTTGGACCTTTTGTTAGTJTTATTTATTTTATAACTATGTCTTCAAAGG 

■ . ■ I 1 1 H 1 I ' ' ' I 1 I i ■ ■ ■ . n ■ . . . I 5428 

ATTCCCAAATTTACAAGAAACCTTGTTTAAACAATGTCCACTACAACCTGGAAAACAATCAAAATAAATAAAATATTGATACAGAAGTTTCC 

KGLNVLWNKFVTGDVGPFVSF I YF I TMSSK 

i Replicase 1 a '• r— r— — 



GTCAAAAGGGTGATGCTGAAGAGGCATTATCTAAATTGTCAGA6TATTTGATTAGTGATTCTATTGTTACTCTTGAACAATATTCAACTTGT 

■ I I I ■ ■ ■ I I ■ I I I 1 ■ ■ ■ I ' ■ ■ I I * ' ' ■ ' I 5520 

CA6TTTTCCCACTAC6ACTTCTCCGTAATAGATTTAACAGTCTCATAAACTAATCACTAA6ATAACAATGA6AACTTGTTATAAGTTGAACA 

GOKGOAEEALSKLSEYLISDSIVTL EQYSTC 
———— —————— Replicase 1 a -— — — — — — 



6ACATTTGTAAAAGTACTGTAGTT6AA6TTAAAAGTGCT6TTGTCTGTGCTA6TGTGCTTAAAGATGGTTGT6AT6TTGGTTTTTGTCCACA 

I : : ■ I 1 I I ■ ' I I ■ I ' ■ ' ' I ^ 5612 

CTGTAAACATTTTCATGACATCAACTTCAATTTTCAC6ACAACA6ACACGATCACACGAATTTCTACCAACACTACAACCAAAAACAGGTGT 

OICKSTV VEVKSAVVCASVLKOGCDVGFCPH 

Replicase 1a 



CAGACATAAATTGC6TTCACGTGTTAAGTTTGTTAATGGACGTGTTGTTATTACCAAT6TTGGT6AACCTATAATTTCACAACCTTCTAAGT 

I . . I ■ ■ I I I I ■ ' ■ I I ■ ■ ■ ■ I ■ ■ I I I I ' 5704 

GTCTGTATTTAACGCAAGTGCACAATTCAAACAATTACCTGCACAACAATAAT6GTTACAACCACTTGGATATTAAAGTGTTGGAA6ATTCA 

RHKLRSRVKFVNGRVVITNVGEPI ISQPSK 

Replicase la 



T6CTTAATGGTATTGCTTATACAACATTTTCAGGTTCTTTTGATAACGGTCACTATGTAGTTTATGATGCTGCTAATAAT6CTGTCTATGAT 

■ > ■ I I I I I ■ I I I I .... I I ■ I I II 5796 

ACGAATTACCATAACGAATATGTT6TAAAAGTCCAAGAAAACTATTGCCA6TGATACATCAAATACTAC6ACGATTATTAC6ACA6ATACTA 

LLNGIAYTTFSGSFDNGHYVVYDAANNAV Y D 
' • Replicase 1 a . 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

8/87 



6GTGCTCGTTTATTTGCTTCAGATTTGTCTACTTTAGCTGTTACAGCTATTGTTGTAGTAGGTGGTTGTGTAACATCTAATGTTCCACCAAT 

I I I 1 "I I ' ' - ■ ' 1 . I ■ ' I ■ I ■ : ■ I 5888 

CCACGAGCAAATAAAC6AAGTCTAAACAGATGAAATCGACAATGTCGATAACAACATCATCCACCAACACATTGTAGATTACAAGGT6GTTA 

GARLFASDLSTLAVT A I VVVGGCVT SNVP P I 
— — Replicase la ' . 



TGTTAGTGAGAAAATTTCTGTTATGGATAAACTTGATACTGGTGCACAAAAATTTTTCCAATTTGGTGATTTTGTTATGAATAACATTGTTC 

■ 1 1 i ■ 1 1 \ 1 I » ■ ' I I 1 1 I 5980 

ACAATCACTCTTTTAAAGACAATACCTATTTGAACTATGACCACGTGTTTTTAAAAAGGTTAAACCACTAAAACAATACTTATT6TAACAAG 

VSEK I SVMDKL DTGAQKFFQFGDFV M NiN I V 

Replicase la 



TGTTTTTAACTTG6TTGCTTAGTATGTTTAGTCTTTTACGTACTTCTATTAT6AAGCAT6ATATTAAAGTTATTGCCAA6GCTCCTAAAC6T 

I I . ■ . ■ i ■ ■ ■ ■ I I ■ I I I I I . I 6072 

ACAAAAATTGAACCAACGAATCATACAAATCAGAAAATGCATGAAGATAATACTTCGTACTATAATTTCAATAACGGTTCCGAGGATTTGCA 

LFLTWLLSMFSLLRTSIMKHDIKVIAKAPKR 

• Replicase 1 a _ 



ACAGGTGTTATTTTGACACGTAGTTTTAAGTATAACATTAGATCTGCTTTGTTTGTTGTAAAGCAGAAGtGGTGTGTTATTGTTACTTTGTT 

i I I ■ ■ ■ ■ I I ■ • I ■ I .1. 1 I I ... I ... . 6164 

TGTCCACAATAAAACTGTGCATCAAAATTCATATTGTAATCTAGACGAAACAAACAACATTTCGTCTTCACCACACAATAACAATGAAACAA 

TGVILTRSFKYNIRSALFVVKQK, WCVIVTLF 
-— — — — ^— Replicase 1 a " 



TAAGTTCTTATTGTTATTATATGCTATTTATGCACTTGTTTTTATGATTGTGCAATTTAGTCCTTTTAATAGTCTTTTATGTGGTGACATTG 

I 1 I 1 i I ■ ■ — I 1 I • I , , ■ . t , ■ , , I 6256 

ATTCAAGAATAACAATAATATACGATAAATACGTGAACAAAAATACTAACACGTTAAATCAGGAAAATTATCAGAAAATACACCACTGTAAC . 

KFLLLLYAI YALVFMIVQFSPF. NSLLCGDl 
' RepHcase la 



TAA6TGGTTATGAAAAATCCACTTTTAATAAGGATATTTATTGTGGTAATTCTATGGTTTGTAA6ATGTGTTTGTTTAGTTATCAAGAGTTT 

■ \ I 1 ■ I I I I ■ ■ ■ . I ■ ■ ■ ■ I I 6348 

ATTCACCAATACTTTTTAGGTGAAAATTATTCCTATAAATAACACCATTAAGATACCAAACATTCTACACAAACAAATCAATAGTTCTCAAA 

VSGYEK STFNKD I YCGNSMVC KMCLFSYOEF 
. " — — Replicase la 



AATGATTTGGATCATACTAGTCTTGTTTGGAAGCACATTCGTGATCCTATATTAATCAGTTTACAACCATTT6TTATACTTGTTATTTTGTT 

■ I I I I 1 1 I I I I 6440 

TTACTAAACCTAGTATGATCAGAACAAACCTTCGTGTAAGCACTAGGATATAATTAGTCAAATGTTGGTAAACAATATGAACAATAAAACAA 

NOLOHTSLVWKHIROPiL lSLQPFVILV ILL 
——————— Replicase 1 a = 



AATTTTTGGTAATATGTATTTGCGTTTTGGACTTTTATATTTTGTTGCACAATTTATTAGTACTTTTG6TTCTTTCTTAGGCTTTCATCAGA 

, I ■ I ■ . ■ ■ I . ■ . ■ I I I I I I I I --+— 6532 

TTAAAAACCATTATACATAAACGCAAAACCTGAAAATATAAAACAACGTGTTAAATAATCATGAAAACCAAGAAAGAATCCGAAAGTAGTCT 

IFGNMYLRFGLLY'FVAQFISTFGSFLGFHQ 
Replicase la 



AACA6TGGTTTTTACATTTTGTGCCGTTTGATGTTTTATGTAATGAGTTTTTA6CTACATTTATTGTCTGCAAAATTGTTTTATTTGTTAGA 

I t I I I I I I ■ ■ ■ I 6624 

TTGTCACCAAAAATGTAAAACACGGCAAACTACAAAATACATTACTCAAAAATCGAT6TAAATAACAGACGTTTTAACAAAATAAACAATCT 

KOWFLHFVPFDVLCNEFLATFIV'CKIVLFVR 
—:————-————-—— —————^ — Replicase 1a ' 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



9/87 



PCT/NL2004/000805 



CATATTATTGTTGGCTGTAATAATGCTGACTGTGTAGCTTGTTCTAAAAGTGCTAGACTTAAACGTGTACCACTTCAAACTATTATTAATGG 

" " ■ 1 I 1 1 1 ' ' " » • 1 \ I ■ ■ ■ I I I 6716 

GTATAATAACAACCGACATTATTACGACTGACACATCGAACAAGATTTTCACGATCTGAATTTGCACATGGTGAAGTTT6ATAATAATTACC 

HI IVGCNN AOCVACSKSARLK RVP.LQTI I NG 
■ Repllcase 1 a — • •■ 



TATGCATAAATCATTCTATGTTAATGCTAATGGTG6TACTTGTTTCTGTAATAAACATAACTTCTTTTGTGTTAATTGTGATTCTTTTG6GC 

■ ' ' I ' ■ ■ ' I ■ I ■ I I I 1 1 I I I I ' ' I I • 6808 

ATAC6TATTTAGTAA6ATACAATTACGATTACCACCATGAACAAAGACATTATTTGTATTGAAGAAAACACAATTAACACTAAGAAAACCCG 

MHKSFYVNANGGTCFCNKHNFFCVNCOSFG 

Repiicase la ' — 



CTGGTAATACTTTTATTAATGGTGATATTGCAAGAGAGCTTGGTAATGTTGTTAAAACAGCTGTTCAACCCACAGCT.CCTGCATATGTTATT 

'I I ' I I I I I I I h 6900 

6ACCATTATGAAAATAATTACCACTATAACGTTCTCTCGAACCATTACAACAATTTTGTC6ACAAGTTGGGTGTCGAGGACGTATACAATAA 



PGNTF I NGD IARELGNVVKTAVQPTAPAYVI 

Replicase 1a 

ATTGATAAGGTAGATTTTGTTAATGGATTTTATCGTCTTTATA&TGGTGACACTTTTTGGCGGTATGACTTTGACATTACTGAATCTAAGTA 

I ■ - I 1 ' ' I ■ ■ ■ I ' I I ■ ■ ' I I I I 6992 

TAACTATTCCATCTAAAACAATTACCTAAAATAGCAGAAATATCACCACTGTGAAAAACCGCCATACTGAAACTGTAATGACTTAGATTCAT 

I DKVD FVNGFYRLYSGDTFWRYDF D I TESKY 
Replicase la 



TAGTTGTAAAGAGGTTCTGAAGAATTGTAATGTTTTAGAAAATTTTATTGTTTACAATAATAGTGGTAGTAACATTACACAGATTAAAAATG 

I I I I .■■. , ■>■■ I I , ■ i ■ . ■ ■ I ■ ■ ■ r , . . . , I ;i I , , ■ , 7084 

atcaacatttctccaa6acttcttaacattacaaaatcttttaaaataacaaatgttattatcaccatcattgtaatgtgtctaatttttac 

sckevlkncnvlenfivynnsgsni tqikn 
• Replicase 1 a ■ . 



CTTGTGTTTATTTTTCTCAATTGTTGTGTGAACCTATAAAGTTGGTAAATTCAGAGTTGTTGTCAACTTTATCA6TTGATTTTAATGGTGTT 

I I I I I , , , ■ i , . . . I ■ . . ■ I I ■ ... I ■ ... I . ... I ■ 7176 

6AACACAAATAAAAAGAGTTAACAACACACTTGGATATTTCAACCATTTAA6TCTCAACAACAGTTGAAATAGTCAACTAAAATTACCACAA 



ACVYFSQLLCEP IKLVNSELLSTLSVDFNGV 

— —————— Replicase 1 a 

TTGCATAAGGCATATGTTGATGTTTTGTGTAATAGTTTTTTTAAGGAGCTAACTGCTAACATGTCCAT66CTGAATGTAAAGCTACACTTGG 

I ' ' ' I 1 ' I ■ ' I II'' 1 I i I I .... I ... I I I I I I 7268 

AACGTATTCCGTATACAACTACAAAACACATTATCAAAAAAATTCCTCGATTGACGATTGTACA6GTACCGACTTACATTTCGAT6TGAACC 



LHKAYVDVLCNSFFKELTANMSMAECKATLG 

, Replicase la = 



TTTGACT6TTTCTGATGATGATTTTGTTTCAGCT6TTGCCAATGCACATAGGTATGACGTTTTGCTTTCAGATTTGTCATTTAATAATTTIT 

■ I I I I I ■ I I ■ ■ ■ I I ■ ■ ■ ■ t ■ I I I ■ I I I 7360 

AAACTGACAAAGACTACTACTAAAACAAA6TC6ACAACGGTTACGTGTATCCATACTGCAAAACGAAAGTCTAAACAGTAAATTATTAAAAA 

LTVSDDDF VSAVANAHRYDV.LLSOLSFNNF 
- — Replicase la 



TTATTTCTTATGCTAAACCTGAAGATAAGTTGTCC6TTTATGACATT6CTTGTTGTATGC6TGCCGGTTCTAAGGTTGTTAACCATAATGTT 

^. ■ I 1 I I I ■ I ■ ■ I ' • I I I ' I ■ ' 7452 

AATAAAGAATAC6ATTTGGACTTCTATTCAACAG6CAAATACTGTAACGAACAACATAC6CACGGCCAAGATTCCAACAATTGGTATTACAA 



F ISYAKPE.OKLSVYDI ACCMRA6. SKVVNHNV 
■ Replicase 1 a ■ 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

10/87 



TTAATCAAAGA6TCAATACCTATTGTTTGGGGTGTCAAGGACTTTAATACTCTTTCTCAAGAAG6TAAGAAGTACCTTGTTAAAACAACTAA 

1 1 1 1 1 1 H4- 1 ^ 1 H ' " I ■ ■ ' I I i ■ ■ ■ ■ 7544 

AATTAGTTTCTCA6TTATGGATAACAAACCCCACAGTTCCTGAAATTATGA6AAAGAGTTCTTCCATTCTTCATGGAACAATTTTGTTGATT 

L I KES I P i VWGVKOF NTLSQEGK k'yLVKT TK 
Replicase la • 



AGCAAAGGGTTTGACTTTTTTATTAACTTTTAATGATAACCAAGCAATTACACAAGTTCCTGCTACTAGTATAGTTGCAAAACAGGGTGCTG 

I I I I I ... I ■ ... I I I I ■ . 1 .11 I I i ■ ■ ■ ■ I . 7636 

TCGTTTCCCAAACTGAAAAAATAATT6AAAATTACTATTGGTTCGTTAAT6TGTTCAAGGACGATGATCATATCAACGTTTTGTCCCACGAC 

AKGLTFLLTFN ONQAI TOVPATS IVAKQG A 

———— Repllcase 1 a — — — — — — — — — — -— 



GTTTTAAACGTACTTATAATTTTCTGTGGTATGTATGTTTATTTGTTGTTGCATTGTTTATTGGTGTCTCATTTATTGATTATACAACCACT 

1 1 M 1 ' » ■ ■ I 1 I I ■ ■ I ■ ■ I ■ ■ ■ I 7728 

CAAAATTTGCATGAATATTAAAAGACACCATACATACAAATAAACAACAACGTAACAAATAACCACAGAGTAAATAACTAATATGTTGGTGA 

GFKRTYNFLWYVCLFVVALF I GVSF lOYTTT 
^— ■— — ^— — — — — — Replicase 1 a — — 



GTAACTAGCTTTCATGGTTATGATTTTAAGTACATTGAGAATGGTCAGTTGAA6GTGTTTGAAGCACCTTTACACTGTGTTCGTAATGTTTT 

■I 1 I , ■ I I I I 1 I ■ . > . . ■ 1 I 7820 

CATTGATC6AAAGTACCAATACTAAAATTCAT6TAACTCTTACCAGTCAACTTCCACAAACTTC6T6GAAATGT6ACACAAGCATTACAAAA 

VTSFHGYDFKY I ENGQLKVFEAPLHCVRNVF 
■ Replicase 1 a . 



TGATAATTTTAATCAATGGCATGAGGGTAAGTTTGGTGTTGTTACTACTAATAGTGATAAATGTCCTATAGTTGTTGGTGTTTCAGAGCGTA 

^ I I I ■ > I I I- ' I • ■ I ■ ' ■ ■ ■ I ■ . 7912 

ACTATTAAAATTA6TTACCGTACTCCGATTCAAACCACAACAATGATGATTATCACTATTTACAGGATATCAACAACCACAAAGTCTCGCAT 

DNFN.QWHEAKFGVVTTNSDKCP. I VVGVSER 
Replicase la r-- 



TTAATGTTGTTCCTGGTGTTCCAACAAATGTATATTTGGTAGGAAAGACTCTtGTTTTTACATTACAGGCTGCTTTTGGAAACACAGGTGTT 

— I I . I ■ ■ ■ I ■ ■ ■ ■ i ■ ■ I ■ ■ ■ I I I I ■ ■ ■ i I I ■ ' ■ ■ 8004 

AATTACAACAAGGACCACAAG6TTGTTTACATATAAACCATCCTTTCTGAGAACAAAAATGTAAT6TCCGACGAAAACCTTTGTGTCCACAA 

I NVVPGyPTNVYLVGKTLVFTLOAAFGflTGV 

—————— Replicase 1 a —————— — 



T6TTATGACTTTGATGGTGTTACCACTAGTGATAAGTGTATTTTTAATTCTGCTTGTACTAG6TTGGAAGGTTTGG6TG6TGACAATGTTTA 

I I h i ' I I . ■ ■ ■ I ■ ■ ■ I I I ■ I i 8096 

ACAATACT6AAACTACCACAATGGTGATCACTATTCACATAAAAATTAAGACGAACATGATCCAACCTTCCAAACCCACCACTGTTACAAAT 

CYOFDGV T TSDKC IFNSACTRLE6LGG0NVY 
Replicase la 



TTGTTACAACACTGATCTTATTGAAGGTTCTAAACCTTATAGTATTTTACAGCCCAATGCTTATTATAAGTATGATGTTAAAAATTATGTAC 

' ■ ■ I ■ I 1 'I 1 I I I I I 8188 

AACAATGTTGTGACTAGAATAACTTCCAAGATTTGGAATATCATAAAATGTCGGGTTACGAATAATATTCATACTACAATTTTTAATACATG 

CYNTDL I EGSKPYSILQPNAYYKYDVKNYV 
• Replicase .1 a 



GTTTTCCAGAAATTTTAGCTAGAGGTTTTGGCTTACGTACTATTAGAACTTTGGCTACACGTTATTGTAGAGTTGGTGAATGCCGTGACTCA 

■ I I I I i I ' I I I I 1 ' ' ■ ■ » I I I ' ' ' I ■ ' ' I 8280 

CAAAAGGTCTTTAAAATC6ATCTCCAAAACC6AATGCATGATAATCTT6AAACCGATGT6CAATAACATCTCAACCACTTACGGCACTGA6T . 

RFPEiLARGFGLRTlRTLATRYCRVGECRDS 

— Replicase 1a 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

11/87 



CATAAAGGTGTTTGTTTTGGTTTTGATAAATGGTATGTTAATGATGGACGTGTTGATGACGGTTACATTTGTGGTGATGGTCTTATAGACCT 

I I ' ' ' I 1 1 ■ I ■ ■ I ■ I I I I > 1 I I 1 ■ ■ I ■ ■ 8372 

GTATTTCCACAAACAAAACCAAAACTATTTACCATACAATTACTACCTGCACAACTACT6CCAATGTAAACACCACTACCAGAATATCTGGA 

HK6VCFGFDKWYVNDGRVD0GY I CGDGL I DL 
Repiicase la 



TCTT6TTAATGTACTCTCAATCTTTA6TTCATCTTTTAGCGTTGTGGCTATGTCTGGACATATGTTGTTTAATTTTCTTTTTGCAGCATTTA 

I ■ ■ I I ■ ■ ■ I ■ ■ ■ • ■ I I I \ I 'I I I ' ■ ■ 8464 

AGAACAATTACATGAGAGTTAGAAATCAAGTAGAAAATCGCAACACC6ATACA6ACCTGTATACAACAAATTAAAAGAAAAACGTCGTAAAT 

LVNVLS IFSSSFSVVAMSGHMLFNFLFAAF 

— ^ — ■ ^ Repiicase la 



TTACATTTTTGTGCTTTTTAGTTACTAAATTTAAACGTGTTTTTGGTGATCTTTCTTATGGTGTTTTTACTGTTGTTTGTGCAACTTTGATT 

I ■ I I I ■ ■ ■ I 11 1 I I I I ■ ' I I I ■ ■ ■ ■ I ■ 8556 

AATGTAAAAACAC6AAAAATCAATGATTTAAATTTGCACAAAAACCACTAGAAAGAATACCACAAAAATGACAACAAACAC6TT6AAACTAA 

ITFL C FLVTKFKRVFGDLSYGVFTVVCATLI 
' — Repiicase 1 a — — — — — — — 



AATAACATTTCTTATGTTGTTACTCAAAATTTATTTTTTATGTTGCTTTATGCTATTTTGTATTTT6TTTTTACTAGGACAGTGCGTTATGC 

. . . I . I ■ I I I I I I I I I I I I . ■ . ■ I ■ ■ ■ 8648 

TTATTGTAAAGAATACAACAATGAGTTTTAAATAAAAAATACAACGAAATACGATAAAACATAAAACAAAAATGATCCTGTCACGCAATACG 

NNISYVVTQNLFFMLLYAILYFVFT. RTVRYA 

Repiicase la • 



TTGGATTTGGCATATTGCATACATTGTTGCATACTTCTTGTTAATACCATGGTGGCTTCTCACATGGTTTAGTTTTGCTGCATTTTTAGAGC 

•H I H 1 ■ ■ I ' ' ■ ' I i ' > I « 1 ■ • 1 ^ 1 ' H 8740 , 

AACCTAAACCGTATAACGTATGTAACAACGTATGAAGAACAATTATGGTACCACCGAAGAGTGTACCAAATCAAAACGACGTAAAAATCTCG 

WIWH lAY IVA. YFLLIPWWLLTWFSFAAFllE 

■ Repiicase 1a . ' 



TTTTACCTAATGTTTTTAAGTTAAAAATCTCTACTCAATTGTTTGAAGGTGATAAGTTTATAGGTACTTTTGAGAGTGCTGCTGCAGGTACA 

I ■ I i I I I I ' ' -I 1 I ■ ■ 8832 

AAAATGGATTACAAAAATTCAATTTTTAGAGATGAGTTAACAAACTTCCACTATTCAAATATCCATGAAAACTCTCACGACGACGTCCATGT 

LLPNVFK LK I STOL FEGDKF I.GTFESAAAGT 

' Repiicase 1 a 



TTTGTTCTTGACATGCGTTCTTATGAAAGGCTGATAAATACTATTTCACCTGAGAAACTTAAGAATTATGCTGCAAGTTATAATAAATATAA 

I ■ . I 1 H 1 1 1 ■■>''■■! 1 8924 

AAACAAGAACTGTACGCAAGAATACTTTCCGACTATTTATGATAAAGTGGACTCTTTGAATTCTTAATACGACGTTCAATATTATTTATATT 

FVLOMRSYERLINTISPEKLKNYAASYNKYK 
' — Repiicase 1a 



ATATTATAGTGGTAGTGCTAGTGAGGCTGATTATCGTTGTGCTTGTTATGCTCATTTAGCCAAGGCTATGTTAGATTACGCAAAAGATCATA 

I i \ I I . ■ ■ ■ I I ■ I . ■ ■ I I I I ■ • ■ I I I I ■ ■ I ■ 9016 

TATAATATCACCATCACGATCACTCCGACTAATAGCAACACGAACAATACGAGTAAATCGGTTCCGATACAATCTAATGCGTTTTCTAGTAT 

yysgsaseaoyrcacy ah. LAKAMLOYAKDH 
-i Repiicase la 



ATGACATGTTATATTCTCCACCTACCATTAGCTACAATTCCACCTTACAATCTG6TCTTAAGAA6AT6GCACAACCATCTGGTT6T6TTGA6 

■ I 1 ■ I I ■ ■ ■ I 1 I ■ ■ ■ ■ I ■ ■ ■ ■ I I ■ t ■ I 9108 

TACTGTACAATATAA6AGGTGGATGGTAATCGAT6TTAAGGTGGAAT6TTAGACCAGAATTCTTCTACCGT6TTGGTAGACCAACACAACTC 

N0MLYSPPTISYNSTL0S6LKKMAOPS6CVE 
— Repiicase 1a 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/00080S 

12/87 



AGATGTGTGGTTCGCGTCTGTTATGGTAGTACTGTGCTTAATGGAGTTTGGTTAGGTGACACTGTTACTTGTCCTAGACATGTCATAGCACC 

I ' ' I ' ■ I 1 I I ■ I ■ I I I I I ■ ■ ■ ■ I I ■ ■ . i I 9200 

TCTACACACCAA6CGCAGACAATACCATCATGACACGAATTACCTCAAACCAATCCACT6TGACAATGAACAGGATCTGTACAGTATCGTG6 

RCVVRVCYGSTVLN6VWL6DTVTCPRHVIAP 

' Rep] lease 1 a — — — — — > 



ATCAACCACTGTTCTTATTGATTAT&ATCATGCATATA6TACTATGCGTTTGCATAATTTTTCAGTGTCTCATAATGGTGTCTTCTT6GGAG 

1 I I I I ' f 1 ■ ' t ■ ■ I I . ■ I ■ ■ ■ ■ I I ■ ■ 9292 

TAGTTGGTGACAAGAATAACTAATACTAGTACGTATATCATGATACGCAAACGTATTAAAAAGTCACAGAGTATTACCACAGAAGAACCCTC 

STTVLIDYOHAYSTMRLHNFSVSHNGVFLG 

Replicase la 



TTGTT6GTGTTACAATGCATGGTTCTGT6TTGCGTATTAAGGTTTCACAATCTAATGTACATACACCTAAACATGTTTTTAAAACGTTGAAA 

I ■ ■ . ■ I ■ ■ ■ ■ I I \ I I I ■ ■ ■ ■ I ■ ■ ■ ■ I II I I I ■ 938<) 

AACAACCACAATGTTACGTACCAAGACACAACGCATAATTCCAAAGTGTTA6ATTACAT6TATGT6GATTTGTACAAAAATTTTGCAACTTT 

VVGVTMHGSVLRIKVSQSNVHTPKHVFKTLK 
; Replicase la 



CCTGGTGCTTCTTTTAATATTTTAGCAT6TTATGAAGGTATT6CATCTGGTGTTTTTGGTGTTAATTTACGTACAAACTTTACTATTAAAGG 

H 1 ■ . ■ , I 1 1 1 1 i I I ■ ■ ■ I I . ■ I I 1 9476 

GGACCACGAAGAAAATTATAAAATCGTACAATACTTCCATAACGTAGACCACAAAAACCACAATTAAATGCATGTTT6AAATGATAATTTCC 

PGASFNILACYEGIASGVFGVNLRTNFTIKG 

Replicase la 



TTCTTTTATAAATGGAGCTTGTGGTTCTCCTGGTTATAATGTTAGAAATGATGGTACTGTTGAGTTTTGTTATTTACACCAAATTGAGTTAG 

■ ' ( I I ' I ■ ' ' I I I ' ' * ' ' ' t I q I i 9568 

AAGAAAATATTTACCTCGAACACCAAGAG6ACCAATATTACAATCTTTACTACCATGACAACTCAAAACAATAAATGTGGTTTAACTCAATC 

SF I NGACGSPGYNVRNOGT VEFCYLHQ I EL 

• Replicase 1 a ' 



GTA6TGGTGCTCATGTTGGTTCTGATTTTACT6GTAGTGTTTATGGTAATTTTGAT6ACCAACCTAGTTTGCAAGTTGAGAGTGCCAACCTT 

■ I I I I I I I I I I I 9660 

CATCACCACGA6TACAACCAAGACTAAAAT6ACCATCACAAATACCATTAAAACTACTGGTTGGATCAAACGTTCAACTCTCACGGTTGGAA 

GSGAH VGSOFTGSVYGNFO DQPSL OVESANL 

. Replicase 1a 



ATGCTATCAGATAATGTTGTTGCCTTTTTGTATGCTGCTTTGTTGAATG6TTGTAGGTGGTGGTTGCGTTCAACTAGA6TTAATGTTGATGG 

I I I I I I I I I ■ ■ 9752 

TACGATAGTCTATTACAACAACGGAAAAACATACGACGAAACAACTTACCAACATCCACCACCAACGCAAGTTGATCTCAATTACAACTACC 

MLSONVVAFLYAALLNGCRWWLRSTRVNVOG 

Replicase 1a 



TTTTAATGAATGGGCTATGGCTAATGGTTATACAATTGTTTCTAGTGTTGAGTGCTATTCTATTTTGGCAGCAAAAACTGGTGTTAGTGTTG 

■>■ I I I I ■ . . , I I . ■ ■ I 1 1 I ' ■ I I I ■ I I ■ ■ ■ 9844 

AAAATTACTTACCCGATACCGATTACCAATATGTTAACAAAGATCACAACTCACGATAAGATAAAACCGTCGTTTTTGACCACAATCACAAC 

FNEWAMANGY T I VSSVECYS I LAAK T6VSV 
——————— Replicase 1 a — — 



AACAATTGTTAGCTTCCATTCAACATCTTCATGAAGGTTTTGGTGGTAAAAACATACTTGGTTATTCTAGTTTATGTGATGAGTTCACACTA 

I I I I 1 ■ ■ I ■ > ■ ■ t I I ' I ■ > I H-^ 1 I ' 9936 

TTGTTAACAATCGAAGGTAAGTTGTAGAAGTACTTCCAAAACCACCATTTTTGTATGAACCAATAAGATCAAATACACTACTCAAGTGTGAT 

EQLLASIQHLHEGFGGKNILGYSSLCDEFTL 

Replicase la 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



13/87 



PCT/NL2004/000805 



GCTGAAGTTGTGAAGCAGAT6TATGGT6TTAACTTGCAAAGTGGTAAG6TTATTTTTG6TTTAAAAACAATGTTTTTATTTAGC6TTTTCTT 

1 1 i I I ' ■ ' ' I • ■ I I I I ... I ■ ■ ■ , , , I I I , I ^0028 

C6ACTTCAACACTTC6TCTACATACCACAATTGAAC6TTTCACCATTCCAATAAAAACCAAATTT.TTGTTACAAAAATAAATCGCAAAAGAA 

AEVVKQMYGVNLQSGKV I FGLKTMFLFSVFF 
^ Repllcase la 



CACAATGTTTTGGGCAGAACTCTTTA^TTTATACAAACACTATATGGATAAACCCTGTTATACTTACACCTATATTTTGTTTACTTTTGTTTT 

■I I I I ■ ■ ■ ■ I I I i I I 10120 

GTGTTACAAAACCCGTCTTGAGAAATAAATATGTTTGTGATATACCTATTTGGGACAATATGAATGTGGATATAAAACAAATGAAAACAAAA 

TMFWAELFIYTNTIWIN .PVILTPIFCLLLF 
— — — ^ — Repltcase la 



TGTCATTAGTTTTAACTATGTTTCTTAAACATAAGTTTTTGTTTTTGCAAGTATTTTTATTACCTACTGTTATTGCAACTGCTTTATATAAT 

.^^^ 1 «H 1 I I .1 1 I 1 I I .1 I ■ ■ 10212 

ACAGTAATCAAAATTGATACAAAGAATTTGTATTCAAAAACAAAAACGTTCATAAAAATAATGGATGACAATAACGTTGACGAAATATATTA 

LSLVLTMFLKHKFLFLOVFLLPTVIATALY N 
• — _ Replicase 1 a . — ' 



TGTGTTTTGGATTATTACATAGTAAAATTTTTGGCTGACCATTTTAACTATAATGTTTCAGTATTACAAATGGATGTTCAGGGTTTAGTTAA 

I I . . ■ I I ■ . I I ■ ■ ■ ■ I ■ ■ ■ I I 1 I ■ ! i 1 ■ ' ' ' 1030^ 

ACACAAAACCTAATAATGTATCATTTTAAAAACC6ACTGGTAAAATTGATATTACAAAGTCATAATGTTTACCTACAA6TCCCAAATCAATT 

CVUOYYIVKFLAOHFNYNVSVLQMDVQGLVN 

— — ^ — — Repltcase 1a '• 



TGTTTTGGTCTGTTTATTTGTTGTATTTTTACACACATGGCGTTTTTCTAAAGAACGTTTCACACATTGGTTTACATATGTGTGTTCTCTTA 

I I I I . I . I ■ » I ^ I ■ ■ I ' I ^ 10396 

ACAAAACCA6ACAAATAAACAACATAAAAATGTGTGTACC6CAAAAAGATTTCTTGCAAAGTGT:GTAACCAAATGTATACACACAAGAGAAT 

VLVCLFVVFLHTWRFSKERFTH WFTYVCSL 

Repilcaste la .• 



TAGCAGTTGCTTACACTTATTTTTATAGTGGTGACTTTTTGAGTTTGCTTGTTATGTTTTTATGTGCTATATCTAGTGATTGGTACATTGGT 

■ I ■ ■ ■ ■ I I I I I ■ ■ > I ■ ■ . ■ I I ■ ■ I i I I 10488 

ATCGTCAACGAATGTGAATAAAAATATCACCACTGAAAAACTCAAACGAACAATACAAAAATACACGATATAGATCACTAACCATGTAACCA 

I AVAYTYFYSGDFLSLLVMF.LCA I SSDWY I 6 

' ^ Replicase la 



GCCATTGTTTTTAGGTTGTCACGTTTGATTATATTTTTTTCACCTGAAAGTGTATTTAGTGTTTTTGGTGATGTGAAACTCACTTTAGTTGT 

-H ■ I ■ ■ I I 1 I I I I 1 ■ ■ I I 10580 

CGGTAACAAAAATCCAACAGTGCAAACTAATATAAAAAAAGTGGACTTTCACATAAATCACAAAAACCACTACACTTTGAGTGAAATCAACA 

aivfrlsrli IFFSPESVFSVFGOVKLTLVV 

Replicase la 



TTATTTAATTTGTGGTTATTTAGTTTGTACTTATTGGGGCATTTTGTATTGGTTCAATAGGTTTTTTAAATGTACTATGGGTGTTTATGATT 

^ 1 1 I 1 1 ■ ■ ■ ■ I 1 1 1 1 1 1 -H H 1 106^2 

AATAAATTAAACACCAATAAATCAAACATGAATAACCCCGTAAAACATAACCAAGTTATCCAAAAAATTTACATGATACCCACAAATACTAA 

YLICGYLVCTYWGILYWFNRFFKCTMGVYD 

Replicase la - 



TTAAGGTGAGTGCTGCTGAATTTAAATACATGGTTGCTAATGGACTTCATGCACCATATGGACCTTTTGATGCACTTTGGTTATCATTCAAA 

I I I I ■ ' ' I I I I I ■ ■ t ■ I 10764 

AATTCCACTCAC6AC6ACTTAAATTTATGTACCAACGATTACCT6AAGTACGTGGTATACCTGGAAAACTACGTGAAACCAATAGTAA6TTT 



FKVSAAEFKYMVANGLHAPYGPFDALWLSFK 

Replicase la^ — 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

14/87 



TTACTTGGTATTGGTGGTGACCGTTGTATAAAAATTTCAACTGTCCAATCCAAACTGACTGATTTGAAGTGTACTAATGTTGTGTTATTGGG 

» I ■ ■ ■ ■ I I ' ' I ■ ' « ' » » I I I ■ ■ I , , ■ . 1 10866 

AATGAACCATAACCACCACTGGCAACATATTTTTAAAGTTGACAGGTTAGGTTT6ACT6ACTAAA.CTTCACAT6ATTACAACACAATAACCC 

LLGIGGDRCIK ISTVQSKLTDLKCTNVVLLG 

• RepHcase 1a — — — 



TTGTTTGTCTAGTATGAACATTGCAGCTAATTCTAGTGAATGG6CTTATTGTGTT6ATTTACACAATAAGATTAATCTTTGTGATGACCCAG 

■ ' ' I ' ■ ' ■ I ' ' ' I I I I . ■ ■ ■ i ■ ■ ■ I I. I ^ 10948 

AACAAACAGATCATACTTGTAACGTCGATTAAGATCACTTACCCGAATAACACAACTAAATGTGTTATTCTAATTAGAAACACTACTGGGTC 

CLSSMN I AANSSEWAYCVDLHNK INLCODP 
— ^ — Replicase la 



AAAAAGCTCAAGGTATGTT6TTAGCACTCCTT6CGTTCTTTCTAA6TAAACATAGTGATTTTGGTCTTGATGGCCTTATTGATTCTTATTTT 

\ H ■ I ■ ■ ■ I I i i ' I ' ' I -I "I I 11040 

TTTTTCGA6TTCCATACAACAATCGTGAG6AACGCAAGAAAGATTCATTTGTATCACTAAAACCAGAACTACC6GAATAACTAAGAATAAAA 

EKAQGMLLALLAFFL SKHSDFGLDGL I DSYF 
— ^— — ^— — ^— Replicase 1 a ' 



GATAATAGTAGCACCCTGCAGAGT6TTGCTTCATCATTTGTTA6TATGCCATCATATAT.TGCTTATGAAAATGCTAGACAAGCTTATGAGGA 

I I I I 1 1 1 i I I • ■ 1 ■ ■ 11132 

CTATTATCATCGTGGGACGTCTCACAACGAAGTAGTAAACAATCATAC6GTAGTATATAAC6AATACTTTTACGATCTGTTC6AA.TACTCCT 

ONSSTLQSVASSFVSMPSY lAYE NARQAYEO 
Replicase 1 a ' 



TGCTATT6CTAATG6ATCTTCTTCTCAACTTATTAAACAATTGAAGCGTGCCATGAATATCGCAAAGTCTGAATTTGATCATGAGATATCTG 

— I ■ I I i [ I 1 ■ * I H I 11224 

ACGATAACGATTACCTAGAAGAAGAGTTGAATAATTTGTTAACTTCGCACGGTACTTATAGCGTTTCA6ACTTAAACTAGTACTCTATAGAC 

AlANGSSSQL IKQLKRAMN lAKSEFOHE I S 
Replicase 1 a ' 



TTCAGAAGAAAATTAATAGAATGGCTGAACAAGCT6CTACTCAGATGTATAAAGAAGCACGCTCTGTTAATAGAAAATCTAAAGTTATTAGT 

I 1 I i I , ■ ■ , I ■ ■ ■ . I I I I I ■ 1 11316 

AAGTCTTCTTTTAATTATCTTACCGACTTGTTCGACGATGAGTCTACATATTTCTTCGTGCGAGACAATTATCTTTTAGATTTCAATAATCA 

VQKK I NRNA EQAATQMYKEARSVNRKSKV 1 S 

Replicase la 



GCTATGCACTCTTTACTTTTTGGAAT6TTAAGAC6TTTGGATATGTCTAGT6TTGAAACTGTTTTGAATTTAGCACGTGATGGT6TTGTGCC 

■ ■ ' I I I I I I ' ' 1 I 11408 

CGATACGTGAGAAATGAAAAACCTTACAATTCTGCAAACCTATACAGATCACAACTTTGACAAAACTTAAATCGTGCACTACCACAACACGG 

AMHSLLFGMLRRLDMSSVETVLNLAROGVVP 

Replicase 1a 



ATTGTCAGTTATACCTGCAACTTCAGCTTCCAAACTAACTATTGTTAGTCCAGATCTTGAATCTTATTCTAAGATTGTTTGTGATGGTTCTG 

■ I ' I ' ■ I I ■ ' i I I I i I » 11500 

TAACAGTCAATATGGACGTTGAAGTC6AAGGTTTGATTGATAACAATCAGGTCTAGAACTTAGAATAAGATTCTAACAAACACTACCAA6AC 

LSV I PAJSASKLT 1 VSP DLESYSK I VCOGS 
— — — Replicase 1 a ^— —————— —————— —— 



TTCATTAT6CT66AGTTGTTTGGACACTTAATGAT6TTAAAGACAATGATGGTAGACCTGTTCAT6TTAAA6AGATTACAA666A6AATGTT 

I I I I ■ ■ I I I I 1 ■ ■ ■ I I I I ' ■ 11592 

AAGTAATACGACCTCAACAAACCTGT6AATTACTACAATTTCTGTTACTACCATCTGGACAAGTACAATTTCTCTAATGTTCCCTCTTACAA 

V-HYAGVVWTLNOVKONDGRPVHV KE I TRENV 
■ Replicase 1 a ' 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

15/87 



GAAACTTTGACATGGCCTCTTATCCTTAATTGT6AACGTGTTGTTAAACTTCAAAATAAT6AAATTATGCCTGGTAAACTTAAGCAAAAACC 

I ■ I I ■ I ■ ■ ■ ■ I i I ■ ■ ■ I I . 1 : t ■ ■ ■ I .III: I I ■ I ■ I I ■ . ■ ■ 11684 

CTTTGAAACT6TACCGGAGAATAGGAATTAACACTTGCACAACAATTTGAAGTTTTATTACTTTAATACGGACCATTTGAATTC6TTTTT6G 

ETLTWPUILNCERVVKLQNNEIMPGKLKQKP 
— —————— —r-=—— Repilcase 1 a '— — ■ 



TATGAAAGCT6AGGGTGATGGTGGTGTTTTAGGT6ATGGTAATGCTTTGTATAATACTGAGGGTGGTAAAACTTTTAT6TATGCTTATATTT 

« ' ■ ■ i I I ■ ■ ■ I I I 1 I ■ i ■ I ■ ■ I 11776 

ATACTTTCGACTCCCACTACCACCACAAAATCCACTACCATTACGAAACATATTATGACTCCCACCATTTTGAAAATACATACGAATATAAA 

MKAEGDGGVLGDGNALYNT. EG6KTFMYAYI 
' Replicase 1 a — 



CTAATAAAGCTGACCTTAAATTTGTTAAGTGGGAGTATGAGGGTGGTTGCAACACAATCGAGTTAGACTCTCCTTGTCGATTTATGGTCGAA 

■ ■ ■ I I I I I I ' ' ■ ■ I 1 ■ ■ ■ 1 11868 

GATTATTTCGACTGGAATTTAAACAATTCACCCTCATACTCCCACCAACGTTGTGTTAGCTCAATCTGAGAG6AACAGCTAAATACCAGCTT 

SNKADLKFV KWEYEGGCNTIELDSPCRFMVE 

Replicase la 



ACACCTAATGGTCCTCAAGTGAAGTATTTGTATTTTGTTAAAAATTTAAATACCTTACGTAGAGGTGCCGTTCTTGGTTTTATAGGTGCCAC 

■ I ' ' ■ ■ I I ■ ■ ■ I . ■ ' I i t I I ' ' » I - I 11960 

TGTGGATTACCAGGAGTTCACTTCATAAACATAAAACAATTTTTAAATTTAtGGAATGCATCTCCACGGCAAGAACCAAAATATCCACGGTG 

TPNGPQVKYL.YFVKNLNTLRRGAVLGF I GAT 

Replicase la 



AATTCGTCTACAAGCTGGTAAACAAACTGAATTG6CTGTTAATTCTGGACTTTTAACTGCTTGTGCTTTTTCTGTTGATCCAGCAACCACTT 

■ ' ■ ■ t ■ ■ ■ ■ I i I \ » — ' " I 1 • > ' ■ ' ■ ' ' ■ I 12052 

TTAAGCAGATGTTCGACCATTTGTTTGACTTAACCGACAATTAAGAeCTGAAAATTGACGAACACGAAAAAGACAACTAGGTCGTTGGTGAA 

IRLQAGKOTEL AVNSGLLTACAFS VDPATT 

Repilcase la ' 



ACTTG6AAGCTGTTAAACATGGTGCAAAACCT.GTAAGTAATTGTATTAAGATGTTATCTAATGGTGCT6GTAATGGTCAAGCTATAACAACT 

I , ■ ■ ■ j ■ . ■ ■ I I . ■ I I I ■ , ■ ■ 1 1 . ■ ■ I I I ■ ■ ■ , t2ia4 

TGAACCTTC6ACAATTT6TACCACGTTTTGGACATTCATTAACATAATTCTACAATAGATTACCACGACCATTACCAGTTCGATATTGTTGA 

YLEAVKHGAKPVS NCIKMLSNGAGNGOAI TT 

Replicase la 



AGTGTAGATGCTAACACCAATCAAGATTCTTAT66TGGAGC6TCTATTTGTTTGTATTGTCGGGCCCACGTTCCTCACCCTAGTAT6GATGG 

I I 1 ■ ■ ■ ■ I I I ' ■ ■ ■ I 1 I I ' 12236 

TCACATCTACGATT6T6GTTAGTTCTAAGAATACCACCTCGCAGATAAACAAACATAACA6CCCGGGT.GCAAGGAGTGGGATCATACCTACC 

SVDANTNQDSYGGASI CLYCRAHVPHPSMDG 
Replicase 1a 



TTACTGTAAGTTTAAGGGTAAATGTGTTCAGGTTCCTATTGGTTGTTTGGATCCTATTAGGTTTTGTTTAGAAAATAATGTGTGTAATGTTT 

■ ■ I I 1 i I 'I ( ■ ■ . . I ■ ■ ■ I I I . I I 12328 

AATGACATTCAAATTCCCATTTACACAAGTCCAAGGATAACCAACAAACCTAGGATAATCCAAAACAAATCTTTTATTACACACATTACAAA 

YCKFKGKCVQVPIGCLDPIRFCLENNVCNV 

Repilcase 1a 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

16/87 



GTGGTTGTTG6TTG6GACACG66TGTGCTT6TGATCGTACAACCATTCAAAGTGTTGACATTTCTTATTTAAACGAGCAAGGG6TTCTAGTG 

1 I ' ■ » ' I ' ' 1 ' ' I ■ I ■ ■ ■ ' I I I ■ ■ I 12420 

CACCAACAACCAACCCTGTGCCCACACGAACACTAGCATGTTGGTAA6TTTCACAACTGTAAAGAATAAATTT6CTCGTTCCCCAAGATCAC 

CGCWLGHGCACORT T ! OSVO I S.Y L ISIEOGVLV 

.. Replicase la ~ • 

, R A R G S S 
' Replicase lb 

CAGCTCGACTAGAACCCTGTAATGGCACG6ACATCGATAAGTGTGTTCGTGCTTTTGACATTTATAATAAAAATGTTTCATTCTTGG6TAAG 

■ ■ I ■ I ■ I I I \ I 1 ■ ■ I I ■ I I i " 12512 

GTCGAGCT6ATCTTGGGACATTACC6TGCCT6TAGCTATTCACACAAGCACGAAAACT6TAAATATTATTTTTACAAAGTAAGAACCCATTC 

OLD., 
-Replicase la -J 

AARLEPCNGTO I OKCVRAFD I YNKNVSFLGK 

Replicase lb = '• ' 



TGTTTGAAGATGAACTGTGTTCGTTTTAAAAATGCTGATCTTAAGGAT6GTTATTTTGTTATAAAGAGGTGTACTAAGTCGGTTATGGAACA 

, 1 1 1 H 1 1— I ■ 1 \ ^ 1 • 1 i 1 12604 

ACAAACTTCTACTTGACACAAGCAAAATTTTTACGACTAGAATTCCTACCAATAAAACAATATTTCTCCACATGATTCAGCCAATACCTTGT 

CLKMNCVRFKNADLKDGYFVIKRCTKSVMEH 

Replicase lb 



CGAGCAATCCATGTATAACCTACTTAACTTTTCTGGTGCTTTGGCTGAGCATGATTTCTTTACTTGGAAAGATGGCAGAGTCATTTATGGTA 

H 1 1 ■ ... I 1 1 t ' I • i I ' ' ' ' ' ' \ I ' ■ ■ ■ I 12696 

GCTCGTTA6GTACATATTGGATGAATTGAAAAGACCACGAAACC6ACTC6TACTAAAGAAATGAACCTTTCTACCGTCTCAGTAAATACCAT 

EQSMYNLLNFSGALAEHDFFTWKDGRV I YG 
Replicase lb ' 



ATGTTAGTAGACATAATCTTACTAAATATACTATGATGGACTTGGTTTATGCTATGCGTAACTTTGATGAACAAAATTGTGATGTTCTAAAA 

^^^H 1 I ' I ■ ■ ■ ■ I ■ ■ ■ ■ I I 1 I I ■ ■ : . ■ i . ■ ■ ■ I 12788 

TACAATCATCTGTATTAGAATGATTTATATGATACTACCTGAACCAAATACGATACGCATTGAAACTACTTGTTTTAACACTACAAGATTTT 

NVSRHNLTKYTHHOLVYAHRNFOEQNCDVLK 
— — — — — — — Replicase 1 b — 



GAAGTATTAGTTTTAACTGGTTGTTGTGACAATTCTTATTTT6ATAGTAAGGGTT6GTATGACCCAGTTGAAAATGAAGATATACATAGAGT 

■ I I ■ . ■ I ■ ■ ■ I I ' ■ ■ . ■ I I I I , , ■ ■ I . ■ ■ . I 12880 

CTTCATAATCAAAATTGACCAACAACACTGTTAAGAATAAAACTATCATTCCCAACCATACTG6GTCAACTTTTACTTCTATAT6TATCTCA 

EVLVLTGCCONSYFOSKGWYOPVENEOI HRV 

Replicase lb 



TTATGCATCTCTTGGCAAAATTGTAGCTA6AGCTATGCTTAAATGC6TT6CTCTATGTGATGCGATGGTTGCTAAAGGTGTT6TT66T6TTT 

■ ■ ■ I ' I I ■ ■ 1 ' ' ■ ■ I I I I \ ■ I ■ ' I I 12972 

AATACGTA6AGAACCGTTTTAACATCGATCTCGATACGAATTTACGCAACGAGATACACTACGCTACCAAC6ATTTCCACAACAACCACAAA 

YASLGK IVARAMLKCVALCDAMVAKGVVGV 
— Replicase lb 



TAACATTAGATAACCAAGATCTTAATGGTAACTTTTATGATTTTGGTGATTTTGTTGTTAGCTTACCTAATATGGGTGTTCCCTGTTGTACA 

1 ■ I I I 1 I ■ > ■ ■ I I ■ ' i » ^ I 'I 1 ■ ' ■ ■ 13064 

ATTGTAATCTATTGGTTCTAGAATTACCATTGAAAATACTAAAACCACTAAAACAACAATCGAATGGATTATACCCACAAGGGACAACATGT 

LTLONOOLNGNFYDFGOFVVSLPNMGVPCCT 

Replicase 1b 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

17/87 



TCATATTATTCTTATATGATGCCTATTAT6GGTTTAACTAATTGTTTAGCTAGTGAGTGTTTTGTCAAGAGTGATATTTTTGGTA6TGATTT 

I ' ■ ■ I I ■ ■ ■ ■ I ■ ■ ■ ■ I I I I ■ ■ ■ • I ■ . . I I ■ ■ ■ I I 13156 

A6TATAATAAGAATATACTACGGATAATACCCAAATTGATTAACAAATC6ATCACTCACAAAACAGTTCTCACTATAAAAACCATCACTAAA 

SY YSYMMPIMGLTNCLASECFVKSDIFGSOF 

Repllcase lb = 



TAAAACTTTTGATTTGCTTAAGTATGATTTCACTGAACATAAAGAAAATTTATTCAATAAGTACTTTAA6CATT6GAGTTTTGATTATCATC 

■ ■ ■ I I ■ ■ ■ ■ I I ■ ■ • ■ ■ I I I I 1 I ' I I 13248 

ATTTTGAAAACTAAACGAATTCATACTAAAGTGACTTGTATTTCTTTTAAATAAGTTATTCATGAAATTCGTAACCTCAAAACTAATAGTAG 

KTFDLLKYDFTEHKENLFNKYFKHWSFOYH 
^ ^Replicase lb 



CTAATTGTAGT6ACTGTTATGATGATATGTGTGTTATACATTGTGCTAATTTTAATACACTATTTGCCACAACTATACCAGGTACTGCTTTT 

, 1 1 ■ ■ ■ I ■ ■ . ■ M 1 I 1 1 t ' ^ 1 1 — ■ ' M ' . t . I ■ ■ I 13340 

GATTAACATCACTGACAATACTACTATACACACAATATGTAACACGATTAAAATTATGTGATAAACGGTGTTGATATGGTCCATGACGAAAA 

PNCSDCYODMCV.IHCANFNTLFATT IPGTAF 

Replicase lb 



•GGTCCACTATGTCGTAAAGTTTTTATAGATGGTGTTCCACTTGTTACAACTGCTGGTTATCATTTTAAGCAATTAGGTTTGGTTTGGAATAA 

^..^ 1 ■ . t 1 ■ . ■ ■ I i 1 1 ■ I ' ■ ■ ■ t ■ ■ . ■ I 1 H I 1 I ' ■ 13432 

CCAGGTGATACAGCATTTCAAAAATATCTACCACAAGGT6AACAAT6TTGACGACCAATAGTAAAATTC6TTAATCCAAACCAAACCTTATT 

GPLC'RKVF IDGVPLVTTAGYHFKQLGLVWNK 

Replicase Ib^ — 



AGATGTTAACACACACTCAGTTAGGTTGACAATCACTGAACTTTTGCAATTTGTTACTGACCCTTCCTTGATAATAGCTTCTTCTCCAGCAC 

1 I I I I I I . 1 1 1 1 1 M l I ■ ■ I " I 1 i < 1 13624 

TCTACAATTGTGTGTGAGTCAATCCAACTGTTAGTGACTTGAAAACGTTAAACAATGACTGGGAAGGAACTATTATCGAAGAAGAGGTCGTG 

DVNTHSVRLTITELLOFVTOPSLI lASSPA 
— — —Replicase lb ■ — ' ^ 



TCGTTGATCAACGCACTATTTGTTTTTCTGTTGCAGCATTGAGTACTGGTTTGACAAATCAAGTTGTTAAGCCAGGTCATTTTAATGAAGAG 

I I 1 ' ■ I I I i 1 13616 

AGCAACTAGTTGCGTGATAAACAAAAA6ACAACGTCGTAACTCAT6ACCAAACTGTTTAGTTCAACAATTCGGTCCAGTAAAATTACTTCTC 

LVDORT I CFSVAA LSTGLTNQVVKP GHFNEE 

Replicase lb 



TTTTATAACTTTCTTCGTTTAAGAGGTTTCTTTGATGAA6GTTCTGAACTTACATTAAAACATTTCTTCTTCGCACAGAATGGTGAT6CTGC 

■ ■ ■ i i ■ ■ ■ I ■ ■ ■ ■ I I I ■ ' I ' ■ ' I 1 1 I I - I 13708 

AAAATATTGAAAGAA6CAAATTCTCCAAAGAAACTACTTCCAA6ACTTGAAT6TAATTTTGTAAAGAA6AAGCGTGTCTTACCACTACGACG 

fynflrlrgffdegseltlkhff'faqngoaa 

Replicase lb • 



tgttaaagattttgacttttaccgttataataagcctaccattttagatatttgtcaagctagagttacatataagatagtctctcgttatt 

. I . I ■ ■ i I I I I ' I 1 , ■ ■ I I I ■ • ■ , I . ■ \ . I I . I ■ I 13800 

acaatttctaaaactgaaaatggcaatattattcggatggtaaaatctataaacagttcgatctcaatgtatattctatcagagagcaataa 

vkofdfyrynkpt iloicqarvtyk ivsry 

Replicase lb 



ttgacatttatgaaggtggctgtattaaggcatgtgaagttgttgtaacaaatcttaataagagtgctggttggccattaaataagtttggt 

I i . ■ ■ I I III I I I I I 1 ' > 13892 

aactgtaaatacttccaccgacataattccgtacacttcaacaacattgtttagaattattctcacgaccaaccggtaatttattcaaacca 

foiyeggcikacevvvtnlnksagwplnkfg 
— — ^— — ^— — ^— — — Replicase 1 b — -— ^— — — — — 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

18/87 



AAAGCTAGTTTGTATTACGAATCTATATCTTATGAAGAACAGGATGCTTTGTTTGCTTTGACAAAGCGTAATGTCCTCCCTACTATGACACA 

' I ' 1 • — I I ' ' ' I \ i ■ ■ ■ I I ■ ■ ■ ■ I ■ ■ ■ I 1398^ 

TTTCGATCAAACATAATGCTTAGATATAGAATACTTCTTGTCCTACGAAACAAACGAAACTGTTTCGCATTACAG6AG6GATGATACTGTGT 

KA SLYYESISYEEQOALFALTKRNVLPTMTO 
—————— ————— — Replicase lb 



GCTGAATCTTAAGTATGCTATTAGTGGTAAAGAACGTGCTAGAACTGTTGGTGGTGTTTCTCTGTTGTCCACAATGACCACAAGACAATACC 

I ' ■ 1 I I ' ' ' I ■ I 1 I ' I 1 1 ' ■ I ' 11 I I I 14076 

C6ACTTAGAATTCATACGATAATCACCATTTCTTGCACGATCTT6ACAACCACCACAAAGAGACAACAGGT6TTACTGGTGTTCTGTTATGG 

LNLKYAISGKERARTV6GVSLLSTMTTR0Y 
Replicase lb 



ATCAAAAACATCTTAAATCCATT6TTAATACACGCAATGCCACT6TTGTTATTGGTACTACCAAAT*TTTATGGT66TTGGAATAATAT6TTG 

■ ■ ■ I t I I ■ ' I I I I ■ r I 14168 

TAGTTTTTGTAGAATTTAGGTAACAATTATGTGCGTTACGGTGACAACAATAACCATGATGGTTTAAAATACCACCAACCTTATTATACAAC 

HQKHLKS I VNTRNATVV i GTTKFYGGWNNML 
^ —Replicase lb 



CGTACTTTAATTGATGGTGTT6AAAACCCTAT6CTCATG6GTTG6GATTATCCCAAATGTGATAGAGCTTT6CCTAACATGATAC6TAT6AT 

I I I • I I ■ I ' ' ■ I I ■ I ■ ■ I I I , ■ I ■ ■ ■ i 14260 

GCATGAAATTAACTACCACAACTTTTG6GATACGAGTACCCAACCCTAATA6GGTTTACACTATCTCGAAACG6ATTGTACTATGCATACTA 

RTL I 0G.VENPHLM6WDYPKCDRALPNM I RM I 
^ — — —Replicase lb 



TTCAGCCATGGTGTTGGGTTCTAAGCATGTTAATTGTTGTACTGTAACAGATAGGTTTTATAGGCTTGGTAACGAGTTGGCACAAGTTTTAA 

I I 1 -I I 1 1 1 1 " I ■ I ■ ■ : — » ■ ■ ■ ■ I 14352 

AAGTC'GGTACCACAACCCAAGATTCGTACAATTAACAACATGACATTGTCTATCCAAAATATCCGAACCATTGCTCAACCGTGTTCAAAATT 

SAMVLGSKHVNCCTVT-DRFYRLGNELAOVL 
■ Replicase 1b 



CAGAAGTTGTTTATTCTAATGGTGGTTTTTATTTTAAGCCAGGTGGTACGACTTCTGGTGACGCTAGTACAGCTTATGCTAATTCTATTTTT 
I I ■ I I I ■ I I . ■ ■ ■ I ■ . . ■ I I ■ ■ ■ ■ t ■ ■ . , I I I ■ ■ I I ■ ■ . ■ 14444 

GTCTTCAACAAATAAGATTACCACCAAAAATAAAATTCGGTCCACCATGCTGAAGACCACTGCGATCATGTCGAATACGATTAAGATAAAAA 

TEVVYSNGGFYFKPGGTTS6 DASTAYANS I F 

Replicase lb 



AACATTTTTCAAGCCGTGA6TTCTAACATTAACA6GTTGCTTAGTGTCCCATCAGATTCATGTAATAATGTTAATGTTAGGGATCTACAACG 

I I I ■ I I I 1 I I I ■ ■ ■ . I . ■ ■ ■ I ^ 14536 

TTGTAAAAAGTTC66CACTCAAGATTGTAATT6TCCAACGAATCACAG6GTAGTCTAAGTACATTATTACAATTACAATCCCTAGAT6TTGC 

N I FOAVSSN I NRLLSVPS05CNNVNVR0LQR 
: • — ■ Replicase 1 b ■ 



ACGTCTGTATGATAATTGCTATAGGTTAACTAGTGTTGAAGAGTCATTCATTGATGATTATTATGGTTATCTTAGGAAACATTTTTCAATGA 

■ I ' I I 1 I t ■ I ' 'I I I I 14628 

TGCAGACATACTATTAACGATATCCAATTGATCACAACTTCTCA6TAAGTAACTACTAATAATACCAATAGAATCCTTTGTAAAAA&TTACT 

RLYDNCYRLTSVEESF lODYYGYLRKHFSh 

Replicase lb — - 



TGATTCTCTCTGATGACGGTGTTGTCTGTTATAACAAGGATTATGCTGAGTTAGGTTATATA6CA6ACATTA6TGCTTTTAAAGCCACTTTG 

* I ■ . I I I I ■ I I I I i i ' ■ ■ ■ I I i 14720 

ACTAAGAGA6ACTACTGCCACAACAGACAATATTGTTCCTAATACGACTCAATCCAATATATCGTCTGTAATCACGAAAATTTC66TGAAAC 

MILS006VVCYNKOYAELGY IAD ISAFKATL 

Replicase lb 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

19/87 



TATTACCAGAATAATGTCTTTATGAGTACTTCTAAATGTTGGGTTGAAGAAGATTTAACTAAGGGACCACATGAGTTTTGTTCCCAGCATAC 

I ■ I I I I 1 I ' ■ I ' 1 ■ I I ' ■ 1^*812 

ATAATGGTCTTATTACAGAAATACTCATGAAGATTTACAACCCAACTTCTTCTAAATTGATTCCCTGGTGTACTCAAAACAAGGGTCGTATG 

YY'O NNVFMSTSKCW VEEDLTKGPHEFCSQHT 

Replicase lb 



TAT6CAAATAGTTGATAAAGAT6GTACCTATTATTT6CCTTACCCAGATCCTA6TAGGATCTTGTCAGCT66TGTTTTTGTTGATGATGTTG 

— I I ■ I ■ ■ ■ ■ ■ I ■ ■ ■ ■ ■ I I I I n ■ ■ I ■ ■ ■ 1490a 

ATACGTTTATCAACTATTTCTACCAT6GATAATAAACGGAAT66GTCTAGGATCATCCTAGAACAGTCGACCACAAAAACAACTACTACAAC 

MOIVOKDGTYYLPYPOPSRILSAGVFVOOV 
—Replicase lb 



TTAAGACAGATGCTGTTGTTTTGtTAKAACGTTATGTGTCTTTAGCTATTGATGCATACCCTCTTTCAAAACACCCTAATTCTGAATATCGT 

H i ■ ■ . i I I , , ■ . • 1 ■ ■ 1 I ■ I i I 14996 

AATTCTGTCTACGACAACAAAACAATMTTGCAATACACAGAAATCGATAACTACGTATGGGAGAAAGTTTTGTGGGATTAAGACTTATAGCA 

VKTDAVVLL7RYVSLAI DAYPLSKHPNSEYR 

Replicase lb 



AAGGTTTTTTACGTATTACTTGATTGGGTTAAGCATCTTAACAAAAATTTGAATGAGGGTGTTCTTGAATCTTTTTCTGTTACACTTCTTGA 

■ ■ ■ I . I t I I • 1 ■ ' I ■ I ■ ■ I I ■ ■ ' I 15088 

TTCCAAAAAATGCATAATGAACTAACCCAATTCGTA6AATTGTTTTTAAACTTACTCCCACAAGAACTTAGAAAAAGACAAT6TGAAGAACT 

KVFYVLLDWVKHLNKNLNEGVLESFSVTLLO 

• RepHcase 1b ^ 



TAATCAAGAAGATAAGTTTTGGTGTGAAGATTTTTATGCTAGTATGTATGAAAATTCTACAATATTGCAAGCTGCTGGCTTATGTGTTGTTT 

■ I ■ ■ ■ I ■ . i ■ ■ I I 1 1 . 1 I ■ ■ i ■ ■ ■ ^ I ; ■ , ■ , , ■ ■ , , I 15180 

ATTAGTTCTTCTATTCAAAACCACACTTCTAAAAATACGATCATACATACTTTTAAGATGTTATAACGTTCGACGACCGAATACACAACAAA 

NOEDKFWCEDFYASMY ENST I IQAAGLCVV 
— Replicase 1 b ■ . 



GTGGTTCACAAACTGTTCTTC6TTGTGGTGATTGTCTGCGTAA6CCTATGTTGT6CACTAAATGTGCATATGATCATGTATTT6GTACCGAC 

I ^ I 1 ■ ■ ■ ■ I ■ ■ ■ i ■ ■ I I i i I 15272 

CACCAAGTGTTTGACAA6AAGCAACACCACTAACAGACGCATTCGGATACAACACGTGATTTACACGTATACTAGTACATAAACCATGGCTG 

CGSQTVLRCGOCLRKPHL CTKCAYDHVFGTD 
— — — — — ^ — Replicase 1 b 



CACAAGTTTATTTTGGCTATAACACCGTATGTATGTAATGCATCAG6TT6TGGT6TTAGTGATGTTAAAAAATTGTATCTTGGTGGTTTGAA 
I I ■ . ■ , I ■ . . ■ I ■ ■ I I I I ■ ■ ■ . I ■ . . ■ I i I . . ■ ■ 153611 

6TGTTCAAATAAAACCGATATTGTGGCATACATACATTACGTAGTCCAACACCACAATCACTACAATTTTTTAACATAGAACCACCAAACTT 

HKF ILA I TPYVCNASGCGVSDVKKLYLGGLN 

^ ^ Replicase 1b 



TTACTATTGTACAAATCATAAACCACAGTTGTCTTTTCCATTATGTTCTGCTGGTAATATATTTGGTTTATATAAAAATTCAGCAACTGGTT 

H 1 , 1 I I 1 1 H I ^ 15456 

AATGATAACATGTTTAGTATTTGGTGTCAACAGAAAAGGTAATACAAGACGACCATTATATAAACCAAATATATTTTTAAGTCGTTGACCAA 

YYCTNHKPQLSFPLCSAGNIFGLYKNSATG 
—Replicase lb 



CCTTAGAT6TTGAAGTTTTTAATA6GCTTGCAACGTCT6ATTGGACTGATGTTAGGGACTATAAACTTGCTAATGATGTTAAAGATACACTT 

1 I I ■ . t ■ I \ 1 1 » ■ " I ' ■ I 1 16548 

6GAATCTACAACTTCAAAAATTATCCGAACGTTGCAGACTAACCTGACTACAATCCCT6ATATTTGAACGATTACTACAATTTCTAT6TGAA 

SLDVEVFNRLATSDWTOVRDYKLANDVKD T L 

- -Replicase 1b^ • 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

20/87 



AGACTCTTTGCGGCTGAAACTATTAAAGCTAAAGAAGAGAGTGTTAAGTCTTCTTATGCTTTTGCAACTCTTAAAGAGGTTGTTGGACCTAA 

' » 1 I I I ' - I ■ ' ) 'I I I I ... I ... I I , I i56tlO 

TCTGAGAAACGCCGACTTTGATAATTTCGATTTCTTCTCTCACAATTCAGAAGAATAC6AAAAC6TTGAGAATTTCTCCAACAACCT6GATT 

RL FAAE T IKAKEESVKSSYAFATLKEVVGPK 

Replicase 1b 



AGAATTGCTTCTTAGTTGGGAAAGTGGTAAAGTTAAACCACCTTTGAATCGTAATTCTGTTTTCACCTGTTTTCAAATAAGTAAGGACTCAA 

■ ' I I i 1 ' ■ I I I ' ' I I ' » 1 15732 

TCTTAACGAAGAATCAACCCTTTCACCATTTCAATTTGGTGGAAACTTAGCATTAAGACAAAAGTGGACAAAAGTTTATTCATTCCTGAGTT 

ELLLSWESGKVKPPLNRNSVFTCF Q I SKDS 
-— Replicase 1 b —————— ^— 



AATTCCAAATAGGTGAGTTCATCtTTGAAAAGGTTGAATATGGTTCTGATACTGTTACGTATAAGTCTACTGTAACCACTAAGTTAGTTCCT 

I ■ ■ ■ i 1 1 ■ I ■ I ' ■ ■ I ■ ■ ■ I I I I ■ • I 15824 

TTAAGGTTTATCCACTCAAGTAGAAACTTTTCCAACTTATACCAA6ACTATGACAATGCATATTCAGATGACATTGGTGATTCAATCAA66A 

KFO I GEFIFEKVEYGSDTVTYKSTVTTKLVP 
— ; Replicase 1 b — —————— ——— 



GGTATGATTTTTGTCTTAACATCTCACAATGTTCAACCTTTACGTGCACCAACTATTGCAAACCAAGAGAAGTATTCTAGCATTTATAAATT 

I ■ I t I i ' I ' I I ■ ■ ■ ■ I : ' I ■ ' ■ ■ I ' ' ' ' I 15916 

CCATACTAAAAACA6AATTGTAGAGTGTTACAAGTTGGAAAT6CACGTGGTTGATAACGTTTGGTTCTCTTCATAAGATCGTAAATATTTAA 

GMIFVLTSHNVOPURAPTIANQEKYSSIYKL 

' Replicase lb 



GCACCCTGCTTTTAATGTCAGTGATGCATATGCTAATTTGGTTCCATATTACCAACTTATTGGTAAACAAAAGATAACTACAATACAGGGTC 

1 I : . . I I I I ■ ■ I 1 ■ ■ ■ i ■ ■ ■ ■ I ■ ■ I » ■ ■ ■ I 16008 

CGTGGGAC6AAAATTACAGTCACTACGTATACGATTAAACCAAGGTATAATGGTTGAATAACCATTTGTTTTCTATTGATGTTATGTCCCAG 

HPAFNVSDAYANLVP YYOL I G KQK I TT I QG 

Replicase lb 



CTCCTGGTAGTGGTAAGTCACATTGTTCCATTGGACTTGGATTGTACTATCCAGGTGCGCGTATTGTTTTTGTTGCTTGTGCCCATGCTGCT 

■ I I ■ ■ ■ ■ t 1 i I I ' t . ■ : I I I ■ . , ■ I ■ ■ ■ ■ I I 16100 

GAGGACCATCACCATTCAGTGTAACAAGGTAACCTGAACCTAACATGATAGGTCCACGCGCATAACAAAAACAACGAACACGGGTACGACGA 

PPGSGKSHCSIGLGLYYPGARIVFVACAHAA 

Replicase 1b 



GTTGATTCCTTATGTGCAAAAGCTATGACTGTTTATAGCATTGATAAGTGTACTAGGATTATACCTGCAAGAGCTCGGGTTGAGTGTTATAG 

^-•^H 1 I ^ - I - I I I I - I -f— 16192 

CAACTAAGGAATACACGTTTTCGATACTGACAAATATCGTAACTATTCACATGATCCTAATATGGACGTTCTCGAGCCCAACTCACAATATC 

VDSLCAKAMTVYSIDKCTRI IPARARVECYS 

Replicase lb 



TGGCTTTAAACCAAATAACACTAGT6CACAATACATATTTAGCACTGTTAAC6CATTACCTGAGTGTAATGCTGATATTGTTGTT6TAGATG 

I I ■ ■ I I I I ■ ■ ' ■ I I 1 ■ ■ ' ■ 16284 

ACCGAAATTTGGTTTATT6TGATCACGTGTTATGTATAAATCGTGACAATTGC6TAAT6GACTCACATTACGACTATAACAACAACATCTAC 

GFKP NNTSAQY I FSTVNALPECNAD IVVVO 

Replicase lb 



AAGTTTCAATGTGTACAAATTATGACCTTTCTGTTATTAATCAGCGTTTATCATATAAACATATTGTTTATGTTGGTGATCCACAACAACTT 

■!■> ■ I I I I i I ■ ■ I I ■ I I : I I ■ I ■ ■ I ■ ■ i ■ 16376 

TTCAAAGTTACACATGTTTAATACTGGAAAGACAATAATTA6TCGCAAATA6TATATTT6TATAACAAATACAACCACTAGGTGTTGTTGAA 

EVSMC TNYDLSV I NQRLSYKH i VYVGOPQO L 
' Replicase 1 b ' 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/00080S 

21/87 



CCTGCACCTAGAGTAATGATTACTAAAGGTGTTATGGAGCCTGTTGATTATAACGTTGTTACTCAACGTATGTGTGCTATAGGCCCTGATGT 

" » » ' ■ ' ■ ■ I ' I " I I i ■ I ■ t . I ^, ■ ■ ■ i6<t68 

GGACGT6GATCTCATTACTAATGATTTCCACAATACCTCGGACAACTAATATTGCAACAATGAGTT6CATACACACGATATCCGG6ACTACA 

PA PRVMlTKGVhEPVOYNVVTORhCAIGPDV 
— — Replicase lb 



TTTTCTTCATAAATGTTATAGAT6TCCT6CTGAAATA6TTAATACA6TTTCT6AACTTGTTTATGAGAACAAGTTTGTCCCTGTTAAACCTG 

' ' I ■ ■ ■ ■ I . . ■ ■ I I I I I I ■ I ■ i I I I I 16560 

AAAAGAAGTATTTACAATATCTACAGGACGACTTTATCAATTAT6TCAAA6ACTTGAACAAATACTCTTGTTCAAACAGG6ACAATTTGGAC 

FLHKCYRCPAEIVNTVSELVYENKFVPVKP 
— Replicase 1b . 



CTAGTAAACAGTGTTTTAAAATCTTTTTTAAGGGTAATGTACAGGTTGACAATGGCTCTAGTATTAACAGAAAGCAGCTTGAAATAGTTAAG 

r ■ ■ ' ' I I : I ■ : ■ I I ■ ■ ■ I I I I 1 I I ■ I I ■ ' 16652 

GATCATTTGTCACAAAATTTTAGAAAAAATTCCCATTACATGTCCAACTGTTACCGAGATCATAATTGTCTTTCGTCGAACTTTATCAATTC 

ASKQCFKIFFKGNVOVDNGSSINRKOLEIVK 
— Replicase lb • — 



CTGTTTTTAGTTAAAAATCCAAGTTGGAGTAAGGCTGTGTTTATTTCTCCTTATAATAGTCAGAATTATGTTGCTAGTAGATTTTTAGGACT 
■ ' ' ' I ' ' ' I i ■ ■ ■ I 1 ■ ■ ■ . I . ■ ■ . I ■ ■ I I I I , , 1 , ... I ... , ^6744 

6ACAAAAATCAATTTTTAGGTTCAACCTCATTCCGACACAAATAAAGAGGAATATTATCA6TCTTAATACAACGATCATCTAAAAATCCTGA' 

LFLVKNPSWSKAVFISPYNSQNYVASRFLGL 

' r- Replicase lb 



TCAAATTCAAACTGTTGATTCTTCTCAAGGTAGTGAGTATGATTATGTAATCTATGCACAAACTTCTGACACTGCACATGCTTGCAATGTAA 

' ■ ■ ■ ■ I I I I . . . ■ I ■ ■ ■ . I j I I — f ' ■ ■ I ■ — --f— 16836 

AGTTTAAGTTTGACAACTAAGAAGAGTTCCATCACTCATACTAATACATTAGATACGTGTTTGAAGACTGTGACGTGTACGAACGTTACATT 

QIOTVOSSQGSEYDYVIYAOTSDTAHACNV 

Replicate 1 b ' 



ACCGTTTTAAT6TTGCTATAACACGTGCTAAGAAGGGTATATTTTGTGTAATGTGTGATAAAACTTTGTTTGATTCACTTAAGTTTTTTGAG 

■ ' ■ I I ' ■ ' ■ I I I 1 I . I ■ ■ ■ I ■ ■ . ■ I ■ I I 16928 

TGGCAAAATTACAACGATATTGTGCACGATTCTTCCCATATAAAACACATTACACACTATTTT6AAACAAACTAAGTGAATTCAAAAAACTC 

NRFNVAITRAKKGIFCVMCDKTLFDSLKFFE • 
— :-Repllcase Ib^ 



ATTAAACATGCAGATTTACACTCTAGCCAGGTTTGTGGCTTGTTTAAAAATTGTACACGCACTCCTCTTAATTTACCACCAACTCATGtACA 

' I I I I ■ ■ ■ ■ I ■ ■ ■ ■ I ■ . I I ■ ■ ■ I • ■ . . , I , , I I I t7020 

TAATTTGTACGTCTAAAT6TGAGATC6GTCCAAACACCGAACAAATTTTTAACAT6TGCGT6A6GA6AATTAAATGGT66TTGA6TACGTGT 

IKHAOLHSSQVCGLFKNCT RTPLNLPPTHAH 
Replicase lb — 



CACTTTCTTGTCGTTGTCAGATCAGTTTAAGACTACAGGTGATTTAGCTGTTCAAATAGGTTCAAATAATGTTTGTACTTATGAACATGTTA 

1 1 1 ^ I ■ ' 1 1 1 1 1— 17112 

GTGAAAGAACAGCAACAGTCTAGTCAAATTCTGATGTCCACTAAATCGACAAGTTTATCCAAGTTTATTACAAACATGAATACTTGTACAAT 

TFLSLSDQFKTTGOLAVOIGSNNVCTYEHV 

Replicase lb 



TATCATTTATGGGTTTTAGGTTTGATATTAGTATTCCTG6TA6TCATAGTTTGTTTTGTACACGTGACTTTGCTATTCGTAATGTGCGTGGT 

' I » I I " I 1 ' ( . ' ■ I 1 M ' ' ' I I I I I 17204 

ATAGTAAATACCCAAAATCCAAACTATAATCATAAGGACCATCA6TATCAAACAAAACATGT6CACTGAAACGATAAGCATTACACGCACCA 

ISFhGFRFDI S IPGSHSLFCTRDFA IRNVRG 
— — ' — Replicase 1b ____ 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

22/87 



T6GTTGGGTATGGATGTTGAAAGTGCTCAT6TTTGTGGCGATAACATAGGTACTAATGTTCCTTTACAGGTTGGTTTTTCAAATGGTGTTAA 

I ■ ■ ■ I ■ I 1 I 1 ■ I ■ ■ --H 1 I ' I ' ' I I I ' I - I ^ 17298 

ACCAACCCATACCTACAACTTTCACGAGTACAAACACCGCTATT6TATCCATGATTACAAGGAAATGTCCAACCAAAAAGTTTACCACAATT 

WLGMDVESAHVCGON I GTNVPL0 .VGFSN6VN 
' Replicase lb- 



TTTTGTTGTGCAAACTGAAGGTTGTGTGTCTACCAATTTTGGTGATGTTATTAAACCTGTTTGTGCAAAATCTCCACCAGGTGAACAATTTA 

■ ■ ■ I I I I ■ I ■ I . . ■ ■ I I I ■ ■ ■ I 1 I I 17388 

AAAACAACACGTTTGACTTCCAACACACAGATGGTTAAAACCACTACAATAATTTGGACAAACACGTTTTAGA6GTGGTCCACTTGTTAAAT 

FVVOTEGCVSTNFGOVIKPVCAKSPPGEQF 
_ — Replicase lb- 



GACACCTTGTTCCTTTTTTACGTAAAGGACAACCTTGGTTAATTGTTCGTAGACGCATTGTGCAAATGATATCTGATTATTTGTCCAATTTG 

I I I I I I I I I I 17*180 

CTGTGGAACAAGGAAAAAATGCATTTCCTGTTGGAACCAATTAACAAGCATCTGCGTAACACGTTTACTATAGACTAATAAACAGGTTAAAC 

RHLVPFLRK GQPWLIVRRRIVOHISDYLSNL 
— — Replicase lb 



TCTGACATTCTTGTCTTTGTTTTGTGGGCAGGTAGTTTGGAATTAACTACAATGCGTTACTTTGTAAAAATA6G6CCAATTAAATATTGTTA 

, 1 , ■ ■ I : I i ■ I I 1 i I I ■ ■ ■ ■ I 1 1 17572 

AGACTGTAA6AACAGAAACAAAACACCCGTCCATCAAACCTTAATTGATGTTACGCAATGAAACATTTTTATCCCGGTTAATTTATAACAAT 

SOILVFVLWAGSLELTThRYFVKIGPIKYCY 

Replicase 1b 



TTGTGGTAATTCTGCCACTTGTTATAATTCAGTTAGTAATGAATATTGTTGTTTTAAACATGCATTGGGTTGTGATTATGTTTACAATCCGT 

I , ■ i ■ ■ I ■ ■ . ■ I I 1— 1 I I ' I ' I ' » ■ ■ ■ ? I ■ ■ V ■ i I ■ ■ ■ ■ 17664 

AACACCATTAAGACGGTGAACAATATTAAGTCAATCATTACTTATAACAACAAAATTTGtACGTAACCCAACACTAATACAAATGTTAGGCA 

C6NSATCYNSVSNEYCCF KHAL6CDYVYNP 

Replicase lb 



ATGCTTTTGATATACAACA6TG6G6TTATGTTG6TTCCTT6AGCCAGAACCACCACACGTTCTGTAACATTCATAGAAACGAGCATGATGCT 

I I ■ I ■ I I ■ ■ ■ I I I I I I I I ■ ■ I I I 17756 

TACGAAAACTATAT6TTGTCACCCCAATACAACCAAGGAACTCG6TCTTGGTGGT6TGCAAGACATT6TAAGTATCTTTGCTC6TACTACGA 

YAFO I QOWGY. VGS-LSQNHHTFCN I H.RNCHDA 
■ ^RepHcase lb 



TCTGGTGATGCTGTTATGACACGTTGTTTGGCAGTACATGATTGTTTTGTCAAAAATGTTGATTGGACTGTAACGTACCCCT.TTATTGCAAA 

I ■ I ■ I \ I I I I I I 17848 

AGACCACTACGACAATACTGTGCAACAAACCGTCATGTACTAACAAAACAGTTTTTACAACTAACCTGACATTGCATGG6GAAATAACGTTT 

SGDAVMTRCLAVHDCFVKNVDWTVTYPF I AN 
— Replicase 1 b ' 



TGAGAAATTTATCAATGGCTGTGGGCGTAATGTCCAGGGACATGTTGTTCGCGCAGCCTTGAAATTGTATAAACCTAGTGTTATTCATGATA 

^ , 1 1 1 1 1 1 I i I I ■ : . I I I ■ ' 1 1 1 h 17940 

ACTCTTTAAATAGTTACCGACACCCGCATTACAGGTCCCTGTACAACAAGCGCGTCGGAACTTTAACATATTTGGATCACAATAAGTACTAT 

EKF I NGCGRNVQGHVVRAA LKLY KPSV I HO 
— Replicase 1b— — — 



TTGGTAATCCTAAAGGTGTACGTTGTGCTGTTACTGATGCCAAATGGTACTGTTATGACAAGCAACCTGTTAATAGTAATGTCAAGTTGTTG 

■ . ■ I I ■ ■ . I I I I I I I ' ■ < 1 » I I I — ■ ■ I ' ■ 18032 

AACCATTAGGATTTCCACATGCAACAC6ACAATGACTACGGTTTACCATGACAATACTGTTC6TT6GACAATTATCATTACAGTTCAACAAC 



IGNPKGVRCAVTDAKWYCYDK0PVN5NVK L L 

Replicase lb ' 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



23/87 



PCT/NL2004/000805 



GATTATGATTATGCAACCCATGGTCAACTTGATGGTCTTTGTTTATTCTGGAATTGTAATGTTGATATGTATCCAGAATTTTCAATTGTGT6 
' ' I ■ ■ ■ I I I I I I I I t . i ■ ■ I I ■ ■ I , ■ ■ I I , . ■ I ■ . , , 18124 

CTAATACTAATACGTTGGGTACCAGTTGAACTACCAGAAACAAATAAGACCTTAACATTACAACTATACATAG6TCTTAAAA6TTAACACAC 

0Y0YATHGQLD6LC LFWNCNVDMYPEFSIVC 
' ' • Replicasd lb 

TCGCTTTGACACAC6TACTCGTTCTGTTTTTAATTTAGAAGGT6TTAATG6TGGTTCTCTTTATGTTAACAAACATGCGTTTCATACACCAG 

' H I ' I I I ' " ' I ' ' I 18216 

AGCGAAACTGTGTGCATGAGCAAGACAAAAATTAAATCTTCCACAATTACCACCAAGAGAAATACAATTGTTTGTAC6CAAAGTAT6TGGTC 

RFDTRTRSVFNLEGVNGGSLYVNKHAFHTP 
— Repllcase 1b 



CATATGATAAACGTGCTTTTGTTAAATTAAAACCTATGCCCTTTTTTTACTTTGATGACAGTGATTGTGATGTTGTGCAAGAACAAGTTAAT 

■ " ' I ■ ■ I . I I I I ■ ■ I I I .... I .... I I I 18308 

GTATACTATTTGCACGAAAACAATTTAATTTTGGATACGGGAAAAAAATGAAACTACTGTCACTAACACTACAACACGTTCTTGTTCAATTA 

AYOKRAFVKLKPMPFFYFODSDCDVVQEQVN 
— ^— — ^— — Replicase 1 b 



TATGTACCCCTTCGCGCTAGTAGTTGTGTTACCCGTTGTAATATAGGTGGTGCTGTTTGTTCAAAACATGCAAATTTGTATCAAAAATATGT 

■ I t I .... > ...■ i 1 I I ' ' ' I t i 18400 

ATACATGGG6AA6CGC6ATCATCAACACAATG66CAACATTATATCCACCACGACAAACAAGTTTTGTACGTTTAAACATAGTTTTTATACA 

YVPLRASSCVTRCNIGGAVCSKHANLYOKYV 
Replicase 1b 



TGAGGCATATAATACATTTACACAGGCTGGTTTTAACATTTG6GTACCACATAGTTTTGATGTTTATAATTTGTGGCAAATTTTTATTGAAA 

> 'l I I I I I ■ ■ I i - i ■ • ■ . ■ I I 1- I , , 18492 

ACTCCGTATATTATGTAAATGTGTCCGACCAAAATTGTAAACCCATGGTGTATCAAAACTACAAATATTAAACACCGTTTAAAAATAACTTT 

EAYNTFT0A6FNIWVPHSFDVYNLW0IF IE 

Replicasa 1 b ' # 'i — ' 



CTAATTTACAAAGTCTTGAAAATATA6CATTTAATGTT6TAAAAAAAGGGT6TTTTACTGGT6TTGATGGTGAGTTACCTGTTGCA6TTGTT 

i 1 I I I ■ ' ■ ■ ■ ' I I t ill. 18584 

GATTAAATGTTTCAGAACTTTTATATCGTAAATTACAACATTTTTTTCCCACAAAATGACCACAACTACCACTCAATGGACAACGTCAACAA 

TNLOSLEN l AFNVVKKGCFT6V DGELPVAVV 

Replicase lb— — _ . — „ — _ 



AAC6ACAAAGTTTTTGTTC6CTATGGCGATGTTGACAACTTGGTTTTTACAAATAAAACAACATTGCCTACTAAT6TTGCTTTTGAATT6TT 

' I I I ■ ■ 1 ■ I I I I I I . . . . I , , , , 1 18676 

TTGCTGTTTCAAAAACAAGCGATACCGCTACAACTGTTGAACCAAAAAT6TTTATTTT6TTGTAACG6ATGATTACAACGAAAACTTAACAA 

NOKVFVRYGDVONLVFTNKTTLPTNVAFELF 
— Replicase lb 



TGCAAAACGAAAAATGGGTTTAACACCACCATTGTCTATTCTCAAAAATCTTGGTGTTGTTGCTACATATAAATTTGTTTTATGGGATTATG 

■ ■ ■ I I ■ ■ I I I I I ■ .1 I I I I I I 18768 

AC6TTTTGCTTTTTACCCAAATTGTGGTGGTAACAGATAA6AGTTTTTAGAACCACAACAACGATGTATATTTAAACAAAATACCCTAATAC 

AKRKMGLTPPLS ILKNLGVVATYKFVLWDY 

Replicase lb 



AAGCT6AAAGACCTTTTACCTCATATACTAA6AGTGTATGTAAATACACTGATTTTAATGAGGATGTTTGTGTTTGTTTTGACAATAGTATT 

■ i i I 1 I 1 I ' I I I ■ I I I I I I I I 18860 

TTCGACTTTCTGGAAAATG6AGTATATGATTCTCACATACATTTATGT6ACTAAAATTACTCCTACAAACACAAACAAAACTGTTATCATAA 

EAERPFTSYTKSVCKYTOFNEOVCVCFDNS I 
■ — __— Replicase 1 b- — 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

24/87 



CAGGGTTCGTATGAGCGTTTTACGCTTACTACGAACGCTGTTTTATTTTCTACTGTTGTCATTAAAAATTTAACACCTATAAAGTTGAATTT 

I ■ ■ ■ ■ I ■ ■ ■ ■ I ■ , I ■ I I I I ■ . ■ I I I .1 , I , I I , , , i I 18952 

GTCCCAAGCATACTCGCAAAATGCGAATGATGCTTGCGACAAAATAAAAGATGACAACAGTAATTTTTAAATTGTGGATATTTCAACTTAAA 

OGSYERFTLTTNAVLFSTVVIKNLTPIKLNF 
■ — Replicase Ib^ — — 



tggtatgttgaatggtatgccAgtttcttctattaagagtgataaaggtgttgaaaaattagttaattggtacacatatgttcgtaaaaatg 

■ " I ■ I I ■ ■ I ■ ■ ■ ■ I ■ ■ ■ I I I ' I I I ■ ■ I I I ■ t : : I . . t I : 19044 

accatacaacttaccatacggtcaaagaagataattctcactatttccacaactttttaatcaattaaccatgtgtatacaagcatttttac 

gmlngmpvss i ksokgveklvnwytyvrkn 
Replicase 1b — 



gtcaatttcaagatcattatgatggtttttacactcaaggtaggaatttatcagactttacaccaagaagtgatatggagtatgattttctt 

■ ' ■ ' ' I I ■ ' ■ ■ I 1 I 1 I 1 I 1— 19136 

cagttaaagttctagtaatactaccaaaaatgtgagttccatccttaaatagtctgaaatgtggttcttcactatacctcatactaaaagaa 
gqfqdhydgfytogrnlsdftprsdmeydfl 

^ Replic^e 1 b ' 



aacatggatatgggtgtttttattaataaatatggtcttgaggattttaattttgaacatgttgtatatggtgatgtttcaaaaactacatt 

■ ' ■ I ' ■ ■ ■ I ■ . ■ ■ I . ■ — 1 1 1 I I ... I 1 ■ i ■ ■ ■ ■ I I I i 19228 

TTGTACCTATACCCACAAAAATAATTATTTATACCAtSAACTCCTAAAATTAAAACTTGTACAACATATACCACTACAAAGTTTTTGATGTAA 

NMDMGVF I NKYGLEOFNFEHVVYGDVSKTTL 

1 Replicase 1b 



AGGAGGTCTTCATTTGTTGATATCACAGTTTAGGCTTAGTAAAATGGGTGTTTTGAAAGCTGATGATTTTGTCACTGCTTCTGACACAACTT 

■I I I I I 1 'I I I i» ' ' ■ I ■ ■ ■ ■ I — * ■ I I 19320 

TCCTCCAGAAGTAAACAACTATAGTGTCAAATCCGAATCATTTTACCCACAAAACTTTCGACTACTAAAACAGTGACGAAGACTGTGTTGAA 

GGLHLL I SQFRLSKMGVL KADOFVTASOTT 

RepKcase lb 



TGAG6TGCTGTACTGTTACTTATCTTAATGAACTTAGTTCAAAAGTTGTTTGTACTTATATGGATTTGTT6TTGGACGACTTTGTTACTATA 

■ ■ I ■ ■ ■ « I ■ ■ ■ ■ I I I I ■ ■ ■ ■ I ■ ■ ■ ■ I ■ , , I I I I I I , . 19(112 

ACTCCACGACATGACAATGAATAGAATTACTTGAATCAAGTTTTCAACAAACATGAATATACCTAAACAACAACCTGCTGAAACAATGATAT 

LRC CTVTYLNELSSKVVCTY MD L LL DDFVT I 
'• Replicase lb 



CTAAA6A6TTTAGATCTTGGTGTAATATCTAAAGTTCATGAAGTTATTATAGATAATAAACCTTATAG6TGGATGTTGT6GTGTAAAGATAA 

1 I 1 I ■ I i . ■ ' 1 ■ I ' ■ 1 1 I 19504 

GATTTCTCAAATCTAGAACCACATTATAGATTTCAAGTACTTCAATAATATCTATTATTTGGAATATCCACCTACAACACCACATTTCTATT 

LKSLDL GViSKVHEVI IDNKPYRWMLWCKDN 
= Replicase lb 1 '• 



CCACTTGTCGACTTTTTATCCACAGTTGCAGTCTGCTGAATGGAAGTGTGGTTATGCTATGCCACAAATTTATAAGCTTCAACGTATGTGTT 

I I I 1 I I I I I ' ■ ■ ■ i 19596 

GGTGAACAGCTGAAAAATAGGTGTCAACGTCAGACGACTTACCTTCACACCAATACGATACGGTGTTTAAATATTC6AAGTTGCATACACAA 

HLSTFYPOLOSAEWKCGYAMPQI YKLORMC 

Replicase lb 



TGGAACCTTGTAATTTATATAATTATGGTGCTGGTATTAAGTTGCCTAGT6GTATAATGTTAAATGTT6TTAAATACACTCAGCTTTGTCAA 

1 I ■ ■ ■ ' t I ■ I I I I I I I I I ■ I I ' " 19688 

ACCTTGGAACATTAAATATAT.TAATACCACGACCATAATTCAACGGATCACCATATTACAATTTACAACAATTTATGT6AGTCGAAACA6TT 

LEPCNLYNYGAGIK.LPSGIMLNVVKYTOLCQ 

Replicase lb 
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TACCTAAATAGCACTACAATGTGC6TACCTCATAATATGCGTGTTTTGCACTATGGTGCTGGTTCT6ACAAAGGTGTGGCACCTGGTACAAC 

I ' I I » I I I ■ ■ , ■ t I ■ ■ I ^ 19780 

ATGGATTTATCGT6ATGTTACACGCAT6GAGTATTATAC6CACAAAACGTGATACCACGACCAAGACTGTTTCCACACCGTGGACCAT6TT6 

YLNSTTHC. VPHNM.RVLHYGAGSOKGVAPGTT 

Replicase lb 



TGTTTTAAAAC6TTGGCTACCACCTGATGCAATAATCATTGATAATGATATCAATGATTAT6TTAGT6ATGCAGATTTTAGCATTACA6GTG 

I I ■ I ■ I I ■ ■ I I I I I I I I I I ■ ■ ■ I 1 19872 

ACAAAATTTT6CAACCGATGGTGGACTACGTTATTAGTAACTATTACTATAGTTACTAATACAATCACTACGTCTAAAATCGTAAT6TCCAC 

VLKRWLP PDAI ! IDNOINDYVSDADFSITG 
— — — ^ Replicase lb 



ATTGTGCTACTGTTTACCTTGAAGATAAGTTTGACTTACTTATTTCTGATATGTATGATGGTAGAATTAAATTTTGTGATGGTGAAAACGTC 

1 . ■ I ' I I 1 I ^ ■ ■ I I . ■ I I I I I I 19964 

TAACACGATGACAAATGGAACTTCTATTCAAACTGAATGAATAAAGACTATACATACTACCATCTTAATTTAAAACACTACCACTTTTGCAG 

OCATVYLEOKFDLL ISDMYDGRI KFCDGENV 
— — ^ Replicase lb 



TCTAAAGATGGTTTTTTTACTTATCTTAATGGTGTTATTAGAGAAAAATTAGCTATTGGTGGTAGTGTTGCCATTAAGATTACAGAATATAG 

I ■ ■ ■ I I ■ I I I I ' I 1 I I I ' f 20056 

AGATTTCTACCAAAAAAATGAATA6AATTACCACAATAATCTCTTTTTAATCGATAACCACCATCACAAC6GTAATTCTAATGTCTTATATC 

SKOGFFTYLNGVIREKLAI GGSVAI K I TEYS 

' —Replicase lb 



TTGGAATAAGTATCTTTATGAATTAATACAAAGATTTGCTTTTTGGACTTTGTTCTGCACGTCTGTTAATACATCCTCTTCAGAAGCTTTTC 

1 I I I 1. ■ ' ' I ■ ■ ■ I ' ■ I ■ t I 20t48 

AACCTTATTCATAGAAATACTTAATTATGTTTCTAAAC6AAAAACCT6AAACAAGACGT6CAGACAATTATGTAG6AGAAGTCTTCGAAAA6 

WNKYLYELIQRFAFWTLFCTSVNTSSSEAF 
— Replicase 1b 



TTATTGGTATTAATTATTTAGGTGACTTTATTCAAG6TCCTTTTATAGCT6GTAACACTGTTCATGCTAATTATATATTTTGGCGTAATTCT 

■ I I ■ ■ ■ I I ■ I > ■ I ■ ■ ■ ■ I I I I I 20240 

AATAACCATAATTAATAAATCCACTGAAATAAGTTCCAGGAAAATATCGACCATT6TGACAAGTACGATTAATATATAAAACCGCATTAA6A 

L 1 G I NYLGDF I Q GPF I AGNTVHANY 1 F WRNS 

Replicase lb 



ACTATTAT6TCTTTGTCATACAATTCA6TTTTAGATTTAAGTAAGTTTGAATGTAAACATAAGGCCACTGTTGTTGTTACACTTAAAGATAG 

I ' ■ ■ ■ I I I I I ' ■ i ■ ' I ■ ' ' ' I I ■ ■ 20332 

TGATAATACAGAAACAGTATGTTAA6TCAAAATCTAAATTCATTCAAACTTACATTTGTATTCCGGT6ACAACAACAATGTGAATTTCTATC 

Tl MSLSYNSVLDLSKF ECKHKATVVV TLKD5 
Replicase lb : 



TGATGTAAATGATATGGTTTTGAGTTTGATTAAGAGTGGTAGGTTGTTGTTACGTAATAGTGGCCGTTTTGGTGGTTTTAGTAATCATTTAG 

— H 1 H- — ■ I I I I \ I I ■ ■ I 4— 20424 

ACTACATTTACTATACCAAAACTCAAACTAATTCTCACCATCCAACAACAATGCATTATCACCGGCAAAACCACCAAAATCATTAGTAAATC 

OVNDMVLSLIKSGR LLLRNSGRFGGFSNHL 
— ■ Replicase lb 
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TCTCAACTAAATGAAACTTTTCTTGATTTTGCTTATTTTGCCCCTGGTTTCTT6CTTTTCTACATGTAACA6TAAT6CTAGTATTTCTATGT 

.1 ' I ■ ■ ■ ■ I I I I I I ■ ■ ■ I ■ . I I ■ ■ I I ^ 20516 

AGAGTTGATTTACTTTGAAAAGAACTAAAACGAATAAAACGGGGACCAAAGAACGAAAAGATGTACATTGTCATTACGATCATAAAGATACA 

.MKLFLILL ILPLVSCFSTCNS. NASI SM 
I Spike 



V S T K . , 
— Replicaselb— ' 

TACAATTAGGTGTTCCTGATAACTCTTCAACTATTGTCACAGGTTTGTTGCCA6TCCATT(5GATTTGT.6CTAATCAGA6TACATCTAGTTAC 

■ ■ ■ J I I I ■ I I I ' ' I ■ ■ ' I I ■ 1 I I ■ 20608 

ATGTTAATCCACAAGGACTATTGAGAAGTTGATAACAGTGTCCAAACAACGGTCAGGTAACCTAAACACGATTA6TCTCAT6TAGATCAATG 

LQLGVPDNSSTIVTGLLPVHWICANQSTSSY 
Spike 



CCAGCCAACGGCTTTTTCTATATTGATGTTGGTAAACACCGTAGTGCCTTTGCACTCCATA6TGGTTATTATGATGCTAACCAGTATTATAT 

■ I I I I I I I I I 1 I I ' I I 20700 

GGTCGGTTGCCGAAAAAGATATAACTACAACCATTTGTGGCATCACGGAAACGTGAGGTATCACCAATAATACTACGATTGGTCATAATATA 

PANGFFY IQVGKHRSAFALHSGYYDANQYY I 
— ^ Spike 



TTATCTCACTAATAAAATACATTTAAATGCTCCTGTCACTCTGAAGATTTGTAAGTTTGGAAACACTTCTTTTGATTTTTTAAGTAATGTTT 

I I I I ■ ■ I I I I I ■ I . .. I I 20792 

AATAGAGTGATTATTTTAT6TAAATTTACGAGGACAGT6AGACTTCTAAACATTCAAACCTTTGTGAAGAAAACTAAAAAATTCATTACAAA 

YLTNK I HLNAPVTLK I CKFGNTSF OFLSNV 

— ^ __ — Spike 



CTACTTCTCATGATTGTATAGTTAATTTGTCATTCACAGAACAGTTAGGTGTGCCTTTGGGCATAACTATATCGGGTGAAACTGTACGTTTG 

I I I i I I I i ' ' I I 'I ' ■ I ■ ' I ' ' ' 20884 

GATGAAGA6TACTAACATATCAATTAAACAGTAAGTGTCTTGTCAATCCACACGGAAACCCGTATTGATATAGCCCACTTTGACATGCAAAC 

STSHDC IVNLSFTEQL6VPLGI TIS6ETVRL 
: Spike : = 



CATTTATATAATGCAACTCGTACTTTTTATGT6CCG6CC6CTTATAAACTTACTAAACTTAGT6TTAAATGTTACTTTA6TGAATCCTGT6T 

I I I ■ ' I I I I I i I 20976 

GTAAATATATTACGTTGA6CAT6AAAAATACACGGCCGGCGAATATTTGAATGATTTGAATCACAATTTACAATGAAATCACTTAG6ACACA 

HLYNATRTFYVPAAYKLTKLSVKCYFSESCV 
Spike 



TTTTAGTGTTGTCAATGCCACCATTACTGTTAATGTCACCACACTTAATG6CCGTATAGTTAACTACACTGTTTGTGATGATTGTAATGGTT 

■ I I I ■ i ' I ■ ■ ■ I ■ ■ ■ I I .... I ■ ... I ■ ■ ■ I . . I ■ I I . 21068 

AAAATCACAACAGTTACGGTGGTAATGACAATTACAGT66TGTGAATTACCGGCATATCAATTGATGT6ACAAACACTACTAACATTACCAA 

FSVVNATITVNVTTLNGRIVNYTVCODCNG 
« Spike ^ 



ATACTGATAACATATTTTCTGTTCAACAGGATGGCCGCATTCCTAATGGTTTCCCTTTTAATAATTGGTTTTTGTTAACTAATGGTTCCACA 

. I I ' I ►H \ I ■ ■ I I--H 1 1 1- — -I t I 21160 

TATGACTATTGTATAAAAGACAAGTTGTCCTACCGGCGTAAGGATTACCAAAGGGAAAATTATTAACCAAAAACAATTGATTACCAAGGTGT 

YTDNI FSVQODGRIPNGFPFNNWFLLTNGST 

Spike — 
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TTAGTGGACGGGGTCTCTAGACTTTATCAACCACTCCGTTTAA CTTGTTTATGGCCTGTACCTGGTCTTAAATCTTCAACTGGTTTTGTTTA 

' ' ■ ■ I ■ ' ' I ■ ' ■ ■ I ■ ■ ■ ■ I ■ I ■ I I I I I I ■ I I ■ II , , 1,1 1 , I 1 , , , I I I I ^ I I r I I I I _ _ I _ . 21252 

AATCACCTGCCCCAGAGATCT6AAATAGTTGGTGAGGCAAATT6AACAAATACCGGACATG6ACCAGAATTTA6AAGTTGACCAAAACAAAT 

L V D G V SRLYOPLRLTCLWPV PGLKSSTGFVY 
Spike — — ^ 

TTTTAATGCCACTGGTTCTGATGTTAATTGTA ACGGCTATCAACATAATTCTGTTGCTGATGTTATGCGTTACAATCTTAACCTCAGTGCTA 

' ' ' ' ' ' ' ' I I ' ■ ■ i I I I ■ ■ . 21314 

AAAATTACGGTGACCAAGACTACAATTAACATTGCCGATAGTTGTAJTAAGACAACGACTACAATACGCAATGTTAGAATTGGAGTCACGAT 

^ N A T GSDVNCNGYOHNS VAOVMRYNLNLSA 
• Spike 



ATTCTGTGGACAATCTTAAGAGT.GGTGTTAT AGTTTTTAAAACTTTACAGTACGATGTTTTGTTTTATTGTAGTAATTCTTCTTCAGGTGTT 

' ' ' ' ' ' ' ' ' i I ■ ... I I ■ I I I I , I . 21*136 

TAA6ACACCT6TTAGAATTCTCACCACAATATCAAAAATTTTGAAATGTCATGCTACAAAACAAAATAACATCATTAAGAA6AA6TCCACAA 

NSVONLKS GV I VFKTL OYOVLFYCSNSSS GV 
— ' Spike ■ • — 



CTTGACACCACAATACCTTTTGGCCCTTCCTCTCAACCTTATTACTGTTTTATAAACAGTACTATCAACACTACTCAT6TTAGCACTTTTGT 

' ' ' ■ ■ ■ i ' ' 1 I ■ ■ ■ I ■ : ■ I : i I . ■ f ■ ■ ■ I I ■ , , I I , . , , 21528 

GAACTGTGGTGTTATGGAAAACCGG6AA6GA6AGTTGGAATAATGACAAAATATTTGTCATGATAGTTGTGATGAGTACAATCGTGAAAACA 

L 0 TTIPFGPSSOPYYCFINSTINTTHVSTFV 
— ^ — Spike • 

GG6TATTTTACCACCCACTGTGCGTGAAATTGTTGTTGCTAGAACTGGTCAGTTTTATATTAATGGTTTTAAGTATTTCGATTTGGGTTTCA 

' ' ' ' ' ' I I ■ ■ r i I I I , I I , I , 7 . I I 21620 

CCCATAAAATGGTGGGTGACACGCACTTTAACAACAACGATCTTGACCAGTCAAAATATAATTACCAAAATTCATAAAGCTAAACCCAAAGT 

G I LPPTVRE I VVART GQFY I NGF KYFDLGF 
— Spike — 



TAGAAG CTGTCAATTTTAATGTCACGACTGCTAGTGCCACAGATTTTTGGACGGTTGCATTTGCTACTTTTGTTGATGTTTTGGTTAATGTT 

' ' ' ' ■ ' 1 ' I ■ I I - I ■ ■ t I 21712 

ATCTTCGACAGTTAAAATTACA6TGCTGACGATCACGGTGTCTAAAAACCTGCCAACGTAAACGATGAAAACAACTACAAAACCAATTACAA 

lEA, V NFNVTTASATOFWTVAFATFVDVLVNV 
— Spike — • ___ 



AGTGCAACTAACATTCAAAACTTACTTTATTGCGATTCTCCATTTGAAAAGTTGCAGTGTGAGCACTTGCAGTTTGGATTGCAAGAT6GTTT 

— H ^ 1 1 I 1 I ■ ■ ' 1 I I i ■ . ■ > 21804 

TCACGTTGATTGTAAGTTTTGAATGAAATAACGCTAAGAG6TAAACTTTTCAACGTCACACTCGTGAACGTCAAACCTAACGTTCTACCAAA 

SAT NIQNLLYCOSPFEK LQCEHLQFGLQDGF 
Spike : . 

TTATTCTGCAAATTTTCTTGATG ATAATGTTTTGCCTGAGACTTATGTTGCACTCCCCATTTATTATCAACATACGGACATAAATTTTACTG 

^ ''''''*''' ' ' ' I 1 I I I ■ ■ I ■ I I ■ ■ I .1 I , J , I I , I .1 21896 

AATAAGACGTTTAAAAGAACTACTATTACAAAACGGACTCTGAATACAACGTGAGGGGTAAATAATAGTTGTAT6CCTGTATTTAAAATGAC 

Y S ANFLDDNVLPETYVALP IYYOHTDINFT 
Spike — 



CAACTGCATCTTTTGGTGGTTC TTGTTATGTTTGTAAACCACGCCAGGTTAATATATCTCTTAATGGTAACACTTCAGTGTGTGTTAGAACA 

' ' ■ I ■ I I I ■ I I I ■ I I i I • .1 I I I , 21988 

GTTGACGTAGAAAACCACCAAGAACAATACAAACATTTGGT6CG6TCCAATTATATAGAGAATTACCATTGTGAAGTCACACACAATCTT6T 

A T A 5 F G 6SCYVCKPR0VNISLN6NTSVCVRT 
' — ' Spike — — — — — 
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TCTCATTTTTCAATTAGGTATATTTATAACCGCGTTAAGAGTGGTTCACCAGGTGACTCTTCATGGCATATTTATTTAAAGAGTG6CACTT6 

' i ■ ■ ■ I I I I ■ ■ I . ■ . . I I . . I I ■ ■ . . I i I , I , , I I 22080 

AGAGTAAAAAGTTAATCCATATAAATATTGGCGCAATTCTCACCAAGTGGTCCACTGAGAAGTACCGTATAAATAAATTTCTCACCGTGAAC 

SHFS I RY.I YNRVKSGSPGD'SSWH I Y LKS6TC 
Spike ; 



TCCATTTTCTTTTTCTAAGTTAAATAATTTTCAAAAGTTTAAGACTATTTGTTTCTCAACCGTC6AAGTGCCTGGTAGTT6TAATTTTCCAC 

I ■ ■ ■ ■ I ■ . ■ , I I.I.I I I I I I ■ . 1 ■ . , , I I , , 22172 

AGGTAAAAGAAAAAGATTCAATTTATTAAAAGTTTTCAAATTCTGATAAACAAAGAGTTGGCAGCTTCACGGACCATCAACATTAAAAGGTG 

PFSFSKLNNFOKFKT ICF6TVEVPGSCNFP 
Spike 



TTGAAGCCACCTGGCATTACACTTCTTATACTATT6TTGGTGCTTT6TATGTTACTTGGTCTGAA66TAATTCCATTACTGGTGTACCTTAT 

I i I ' ■ ■ ■ I I I I ■ ■ i I 22264 

AACTTCGGTGGACCGTAATGTGAAGAATATGATAACAACCACGAAACATACAATGAACCAGACTTCCATTAAG6TAATGACCACATGGAATA 

LEATWHYTSY T IVGALYVTWSEGNSI TGVPY 
Spike '• 



CCTGTCTCTGGTATTCGTGAGTTTAGTAATTTAGTTTTAAATAATTGTACCAAATATAATATTTATGATTATGTTGGTACTGGAATTATACG 

" I ■ ' 1 1 1 1 t I ■ ■ I ■ ■ I 1 ' ■ ' ' I I ' ' 'I I 22356 

GGACAGAGACCATAAGCACTCAAATCATTAAATCAAAATTTATTAACATG6TTTATATTATAAATACTAATACAACCATGACCTTAATATGC 

PVSGIREFSNLVLNNCTKYNIYOYVGTGI IR 
Spike 



TTCTTCAAACCAGTCACTTGCTGGTG6TATTACATATGTTTCTAACTCTGGTAATTTACTTGGTTTTAAAAATGTTTCCACTGGTAACATTT 

■ ■ ' I I lil t -' I . ■ I ■ I . . ■ I I r , , I I I — , 1 ' ' i I ■ I I I ■ 22148 

aagaagtttggtcagtgaacgaccaccataatgtatacaaagattgagaccattaaatgaaccaaaatttttacaa^ggtgaccattgtaaa 

ssnqslaggityvsnsgnlLgfknvstgni 
Spike ' : : 



ttattgtgacaccatgtaaccaaccagatcAagtagctgtttatcaacaaagcattattggtgccatgaccgctgttaatgagtctagatat 

■ I ' ■ ■ ■ i I ■ ■ ■ ' I ■ . ' ' I 1 ■ t >\ I •' . ■ I I t I 22540 

aataacactgtggtacattggttggtctagttcatcgacaaatagttgtttcgtaataaccacggtactggcgacaattactcagatctata 

fivtpcnqpdo'vavyqqsi igamtavnesry 
Spike 



ggcttgcaaaacttactacagttacctaacttttattatgttagtaatggtggtaacaattgcactacggctgttatgatttattctaattt 

' ' I ' ' i I I I I I I .. I I I I ... I I . ■ 22632 

CC6AACGTTTTGAATGATGTCAATGGATTGAAAATAATACAATCATTACCACCATTGTTAACGT6AT6CCGACAATACTAAATAAGATTAAA 

GLQNLLQLPNFYYVSNGGNNCTTAVhl YSNF 
— Spike 



TGGTATTTGTGCTGATGGTTCTTTAATTCCTGTTCGTCC6CGTAATTCTA6T6ATAATGGTATTTCAGCCATAATCACTGCTAATTTATCCA 

1 ■ ■ ■ i I I ■ ... I I ■ I I 1 I I I I I 22724 

ACCATAAACAC6ACTACCAAGAAATTAA6GACAAGCAGGCGCATTAAGATCACTATTACCATAAAGTCGGTATTAGTGACGATTAAATAGGT 

GICAOGSLIPVRPRNSSONGISAI ITANLS 
Spike 



TTCCCTCTAACTGGACTACTTCAGTTCAAGTTGAGTACCTCCAAATTACTAGTACTCCAATAGTTGTTGATTGTGCTACTTATGTGTGTAAT 

I ' H-^ 1 I I I 1 > ■ ■ ' I I ' ' ' < I I 22816 

AAGGGAGATTGACCTGATGAAGTCAAGTTCAACTCATGGAGGTTTAATGATCATGAGGTTATCAACAACTAACACGATGAATACACACATTA 

IPSNWTTSVQVEYLQITSTPIVVDCATYVCN 
Spike 
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6GTAACCCTCGTT6TAAGAATCTACTTAAGCAGTATACTTCTGCTTGTAAAACTATTGAAGATGCCTTACGACTTAGTGCTCATTTGGAAAC 

■ ■ ■ I I I I ' I I ■ I \ I I I ■ . I I i II 22908 

CCATT66GAGCAACATTCTTAGATGAATTC6TCATATGAAGACGAACATTTTGATAACTTCTACGGAATGCTGAATCACGAGTAAACCTTTG 

6NPRCKNLLK0YTSACKT |. EDALRLSAHLET 
Spike ■ 



TAATGATGTTAGTAGTATGCTAACTTTCGATAGCAATGCTTTTAGTTTGGCTAATGTTACTAGTTTTGGAGATTATAACCTTTCTAGTGTTT 

■I 1 I I I I ) . I . I I . ■ I , i 1 ' ■ ■ ■ I 1- 23000 

ATTACTACAATCATCATACGATT6AAAGCTATCGTTACGAAAATCAAACCGATTACAATGATCAAAACCTCTAATATTGGAAAGATCACAAA 

NDVSSMLTFD SNAFSLANVTSFGDYNLSSV 
Spike ' : 



TACCTCAGAGAAACATTCATTCAA6CCGTATAGCAGGACGTA6TGCTTTGGAAGATTT6TT6TTTA6CAAAGTTGTTACATCTGGTTTGGGT 

I ■ . , ■ I , ■ \ I I I I ■ ■ ■ I I I I — 23092 

ATG6AGTCTCTTTGTAA6TAAGTTCGGCATATCGTCCT6CATCACGAAACCTTCTAAACAACAAATCGTTTCAACAATGTAGACCAAACCCA 

LPQRNIHSSRIAGRSALEOLLFSKVVTSGLG 
Spike 



ACTGTTGATGTTGACTATAAGTCTTGTACTAAAGGTCTTTCTATTGCTGACCTTGCTT6TGCTCAGTACTACAATGGCATAATGGTTTTGCC 

I 1 I ' ■ ■ I I I ' ■ ■ I I I I ■ I I 23184 

TGACAACTACAACTGATATTCAGAACATGATTTCCAGAAAGATAAC6ACTGGAACGAACACGAGTCATGATGTTACC6TATTACCAAAACGG 

TVD VOYKSCTKGL SI.ADLACAQYYNG I MVLP 



AGGTGTTGCTGATGCTGAACGTATGGCCATGTACACAG6TTCTCTTATAGGTGGCATGGTGCTCGGAGGTCTTACATCAGCAGCCGCCATAC 

I I ■ ■ ■ I- . ■ ■ I I i I ■ ■ ' -I ■ I r J ■ ■ ■ H I ■ ■ I : 23276 

TCCACAACGACTACGACTTGCATACCGGTACATGTGTCCAAGAGAATATCCACCGTACCACGAGCCTCCAGAATGTAGTCGTCGGCGGTATG 

6VA0AERMAMYTGSLIG6MVLGGLTSAAAI 
. Spike \ 



CTTTTTCTTTG6CACTGCAAGCACGACTTAACTATGTTGCTTTACAAACTGATGTGCTTCAAGAAAATCAGAAAATTTTGGCTGCATCATTT 

« ■ ■ I 1 ' ■ I ■ ■ ■ ■ I I ■ ■ ■ I > I I * I ■ I i I ■ ■ I 23368 

GAAAAAGAAACCGTGACGTTC6TGCTGAATTGATACAACGAAATGTTTGACTACACGAAGTTCTTTTAGTCTTTTAAAACCGACGTAGTAAA 

PF SLALQARLNYVALQTDV.LOENQK I LAASF 
: Spike — — — — i^^— — 



AATAAGGCTATTAATAATATTGTTGCTTCTTTTAGTAGCGTTAATGATGCTATTACACATACTGCAGAGGCTATACATACTGTTACTATTGC 

■ i I ■ I I 1 I I I I ■ . I ■ ■ ■ ■ I ■ : i ■ I i ■ . I 1 23460 

TTATTCCGATAATTATTATAACAACGAAGAAAATCATCGCAATTACTACGATAATGTGTATGACGTCTCCGATATGTATGACAATGATAACG 

NKAINNIVASFSSVNDAITHTAEAIHTVTIA 



ACTTAATAAGATTCAGGATGTTGTTAATCAACA6GGTA6TGCTCTTAACCATCTCACTTCACAATT6AGACATAATTTTCAGGCCATTTCTA 

..-~H r ■ " i ■ ■ ■ ■ I i ' I I ' ' I I » 1— 23552 

TGAATTATTCTAAGTCCTACAACAATTAGTTGTCCCATCACGAGAATTGGTAGAGTGAA6T6TTAACTCT6TATTAAAAGTCC6GTAAA6AT 

LNK I O DVVNGQGSA LNHLTSQLRHNFQA I S 



ATTCAATTCATGCTATTTATGACCG6CTTGATTCAATTCAAGCCGATCAACAAGTTGACAGATTAATTACTGGACGGCTT6CA6CTTT6AAT 

- I ' I I I ■ : I 'I 1 ■ ■ I ■ 1 i 1 ■ ■ I ■ ■ ■ ' 23644 

TAAGTTAAGTACGATAAATACTGGCCGAACTAAGTTAAGTTCGGCTAGTTGTTCAACTGTCTAATTAATGACCTdCCGAACGTCGAAACTTA 

NS I HA I YDRLDS 1 QADQOVORL I TGRLAALN 
Spike 
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GCATTTGTTTCCCAAGTTTTGAATAAATATACTGAAGTTCGTGGTTCqAGACGCTTAGCACAGCAGAAGATTAATGAATGTGTCAAGTCACA 

' ' ' ' ' ' ' 'I I ' ■ ' ' ' I ' ■ I ' ■ ■ ■ I I i ' ' \ I I 1 23736 

CGTAAACAAAGGGTTCAAAACTTATTTATATGACTTCAAGCACCAAGGTCTGCGAATCGTGTCGTCTTCTAATTACTTACACAGTTCAGTGT 

AFVSQVLNKYTEVRGSRRLAQQK I NECV. KSQ 
Spike — 



ATCTAATAGATATGGTTTTTGTGGCAATGGCACTCACATCTTTTCAATCGTCAACTCAGCTCCAGATGGTTTGCTTTTTCTTCATACTGTTT 

■ I I I I I \ \ I I I I I . I 23828 

TAGATTATCTATACCAAAAACACCGTTACCGTGAGTGTAGAAAAGTTAGCAGTTGA6TCGAGGTCTACCAAACGAAAAAGAAGTATGACAAA 

SNRY6FC6NGTHIFSIVNSAPDGLLFLHTV 
Spike ' 



TGCTGCCAACTGATTACAAGAATGTAAAGGC6TGGTCTGGTATCTGTGTTGATGGCATTTAT6GCTATGTTCTGCGTCAACCTAACTT6GTT 

■I 1 I . ■ ■ I i ^ 1 ■ ■ 'I I I I I ■ ■ I ' ■ ■ ■ 1 23920 

ACGACGGTTGACTAATGTTCTTACATTTCCGCACCAGACCATAGACACAACTACCGTAAATACCGATACAAGACGCAGTTGGATTGAACCAA 

LLPTOYKNVKAWSGi CVDG I Y6YVLRQPNLV 

• Spike ' 



CTTTATTCTGATAATGGT6TCTTTCGTGTAACTTCCAGGGTCATGTTTCAACCTCGTTTACCTGTTTTGTCTGATTTTGTGCAAATATATAA 

I I I I I I I I I ■ ■ 24012 

GAAATAAGACTATTACCACAGAAAGCACATTGAAGGTCCCAGTACAAAGTTGGAGCAAATGGACAAAACAGACTAAAACACGTTTATATATT 

LY SDNGVFRVTSRVMFQPRLPVLSDFVQ I YN 
Spike 



TTGTAATGTTACTTTTGTTAACATATCTCGTGTCGAGTTACATACTGTCATACCTGACTACGTTGATGTTAATAAAACATTACAAGAGTTTG 

I I I I " 'I I 1 ^' ' ■ ' I I » 1 ■ ■ ■ ■ 2410« 

AACATTACAATGAAAACAATT6TATAGAGCACAGCTCAATGTATGACAGTATGGACTGATGCAACTACAATTATTTTGTAATGTTCTCAAAC 

C NVTFVN I SRVELHTV IPOYVO VNKTLQEF 
Spike T 



CACAAAACTTACCAAAGTATGTTAAGGCTAATTTTGACTTGACTCCTTTTAATTTAACATATCTTAATTTGAGTTCTGAGTTGAAGCAACTC 

I ■ I ' I ' ■ ' I t ' I . I . . I . ■ . I . I I 1 ■ ■ , ■ I ■ , ■ , I ■ ■ , i , 21196 

GTGTTTTGAATG6TTTCATACAATTCGGATTAAAACTGAACTGAGGAAAATTAAATTGTATAGAATTAAACTCAAGACTCAACTTCGTTGAG 

AQNLPKYV KP NFO LTPFNLTYLNLSSELKQL 
Spike 



GAAGCTAAAACTGCTAGTCTTTTCCAAACTACTGTTGAATTACAAGGTCTTATTGATCAGATTAACAGTACATATGTTGATTTGAAGTTGCT 

■ ■ ■ I I I I I 1 I i I 24288 

CTTC6ATTTTGACGATCAGAAAAGGTTTGATGACAACTTAATGTTCCAGAATAACTAGTCTAATTGTCATGTATACAACTAAACTTCAAC6A 

EAKTASLFQTTVELQGL IDQINSTYVDLKLL 
Spike 



TAATA6GTTTGAAAATTATATCAAAT66CCTTGGTGGGTTT6GCTCATTATTTCTGTTGTTTTTGTTGTATTGTTGAGTCTTCTTGT6TTTT 

■I I I I I I ' I ' I I I I 24380 

ATTATCCAAACTTTTAATATA6TTTACC66AACCACCCAAACCGA6TAATAAA6ACAACAAAAACAACATAACAACTCAGAA6AACACAAAA 

NRFENYIKW 'PWWVWLI ISVVFVVLLSLLVF 
■ Spike —————— —i^^—— 



6TTGTCTTTCTACA6GTT6TTGTGGTT6TTGCAATTGTTTAACTTCATCAATGCGAGGCTGTT6T6ATTGTGGTTCAACTAAACTTCCTTAT 

t I I ' ■ ' ■ I ■ ■ ■ ■ I I I ■ I 1 ■ I I ■ 24472 

CAACAGAAAGATGTCCAACAACACCAACAACGTTAACAAATTGAAGTAGTTACGCTCC6ACAACACTAACACCAAGTTGATTTGAAGGAATA 

CCLSTGCCGCCNCLTSSriRGCCDCGSTKLPY 

— Spike = 
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TATGAATTTGAAAAGGTCCACGTTCAATAATGCCTTTCGGTGGCCTATTTCAACTTACTCTTGAAAGTACTATTAATAAGAGTGTGGCTAAT 

. I I i I I ■ ■ ■ » r I ■ ■ . ■ ■ I I ■ ■ ■ i . I . . I 1 ■ ■ ■ 24564 

ATACTTAAACTTTTCCAGGTGCAAGTTATTAC6GAAAGCCACCGGATAAAGTTGAAT6AGAACTTTCATGATAATTATTCTCACACCGATTA 



YEFEKVHVQ 
— Spike ' 



.hPFGGLFQLTLEST INKSVAN 
I ORF 4ab 



CTCAAATTACCACCTCATGATGTTACTGTCTTGCGTGACAATCTTAAACCTGTTACTACACTTAGTACTATCACTGCTTATTTGTTAGTTAG 

I.I. I I ■ ' -I I . ' ■ ■ I I 1 ■ ■ ' I I I I I ' ■ ■ I > ■ 24656 

GA6TTTAATGGTGGAGTACTACAATGACAGAAC6CACTGTTAGAATTTG6ACAATGATGTGAATCATGATA6TGACGAATAAACAATCAATC 

LKLPPHDVTVLRDNLKPVTTLST I TAYLLVS 
ORF 4ab— 



TTT6TTT6TCACTTATTTTGCTTTATTCAAACCTCTTACTGCTAGAGGTC6CGTTGCTTGTTTTGTTTTAAAACTATTGACACTATCTGTCT 

■ ■ ■ I I ' I ' . 1 > I ■ : : : I | I I I I 24748 

AAACAAACAGTGAATAAAACGAAATAAGTTTGGAGAATGACGATCTCCAGCGCAACGAACAAAACAAAATTTTGATAACTGTGATAGACAGA 

LFVTYFALFKPLTARGRVACF VLKLLTLSV 
ORF 4ab 



ATGTGCCTTTATTGGTTCTTTTTGGTATGTATCTT6ACAGTTTTATAATTTTTTTTCTACGCTGTTGTTTCGATTCATACATGTT6GCTATT 

I I 1 . I I I I 1 1 1 • H I ' H- 24840 

TACACGGAAATAACCAA6AAAAACCATACATAGAACTGTCAAAATATTAAAAAAAAGATGCGACAACAAAGCTAAGTATGTACAACC6ATAA 

YVPLLVLFGMYLDSFI IFFLRCCFDSYMLAI 
— ORF 4ab 



ATGCCTATCTCTAATAAAAATTTTTCATTTGTTTTGTTCAATGTTACTAAACTATGCTTCGTTTCA6GCAAGTGTT6GTATCTTGAACAATC 

I II I I I I I . . . ■ ■ I ■ ■ ■ I i I 1 1 ■ ■ ■ i ■ ■ 24932 

TACGGATAGAGATTATTTTTAAAAAGTAAACAAAACAAGTTACAATGATTTGATACGAAGCAAAGTCCGTTCACAACCATAGAACTTGTTA6 

MPISNKNFSFVLFNVTKLCFVSGKCWYLEQS 
■ ^ORF 4ab 



ATTTTATGAAAATCGTTTTGCTGCTATTTATGGTGGTGACCACTATGTCGTTTTAGGT6GT6AAACTATTACTTTTGTTTCTTTTGATGACC 

■ I I 1 I ... I I ■ . 1 ■ I i ■ I ■ ' ' 1 - I ■ I I ■ ■ ■ ■ I ■ . ■ . I f , . I ■ ■ ■ ■ 25024 

TAAAATACTTTTAGCAAAACGACGATAAATACCACCACTGGTGATACAGCAAAATCCACCACTTTGATAATGAAAACAAAGAAAACTACTGG 

FYENRFAAIY6G0HYVVLGGET I TFVSFDD 
— ORF 4ab 



TTTATGTTGCTATTAGAGGTTCTTGTGAAAAGAACCTACAACTTATGCGTAA6GTTGACTTGTATAATGGTGCTGTCATTTACATTTTTGCC 

H 1 , 1 1 1 ^H-H 1 H 1 1 i 1 I I 25116 

AAATACAACGATAATCTCCAAGAACACTTTTCTTGGATGTTGAATACGCATTCCAACTGAACATATTACCACGACAGTAAATGTAAAAACGG 

LYVAIRGSCEKNLQLMRKVOLYNGAVIY IFA 
— — ^ ORF 4ab- 



GAAGAGCCTGTTGTTGGTATAGTTTACTCCTCTCAACTATACGAAGATGTTCCTTCGATTAATTGATGACAATGGCATTGTCCTCAATTCTA 

... I 1 I I I I I ■ ■ ■ .I I I I I ■ I 25208 

CTTCTCGGACAACAACCATATCAAATGAGGAGAGTT6ATATGCTTCTACAAGGAAGCTAATTAACTACTGTTACCGTAACAGGAGTTAAGAT 

.NFLRLIDDNGIVLNS 
I E 



EEPVVGIVYSSOLYEOVPSIN 
ORF 4ab 
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TTTTATG6CTCCTTGTTATGATATTTTTCTTTGTGTTGGCAATGACCTTTATTAAACTGATTCAATTGTGTTTTACTTGTCATTATTTTTTT 

■ I ■ ' ' ■ ■ ■ I i I ' i ' I I I I . > > i I i 25300 

AAAATACCGAGGAACAATACTATAAAAAGAAACACAACCGTTACT6GAAATAATTTGACTAAGTTAACACAAAATGAACAGTAATAAAAAAA 

ILWLLVMIFFFVLAMTFIKLIQLCFTCHYFF 
. : E . 



AGT.AGGACATTATATCAACCAGTTTATAAAATTTTTCTTGCTTACCAAGATTATAT6CAAATAGCACCTGTTCCAGCTGAA6TACTAAATGT 

I I 1 ' ' ' I ■ ■ ■ ■ I . . ■ I 1 ' I 1 I ■ ' 25392 

TCATCCTGTAATATAGTTGGTCAAATATTTTAAAAA6AACGAATGGTTCTAATATACGTTTATCGTGGACAAGGTCGACTTCATGATTTACA 

SRTLYQPVYKIFLAYODYMQIAPVPAEVLNV 
; E 



CTAAACTAAACGATGTCTAATAGTAGTGTGCCTCTTTCAGAGGTTTAT6TCCATTTAC6TAACT6GAACTTTAGTTGGAATTTAATTCTAAC 

I .... I ■ ... I I I I I I ■ I I ■ ■ ■ ■ I 2548'* 

GATTTGAtTTGCTACAGATTATCATCACACGGAGAAAGTCTCCAAATACAGGTAAATGCATTGACCTTGAAATCAACCTTAAATTAAGATTG 

.MSNSSVPL SE V.VVHLRNW NFSWNL I LT 
-eJ I 1^ : : 



AGTTTTTATA6TTGTGTTGCAGTATG66CATTATAAGTATAGCAGACTTCTTTATGGTTTAAA6ATGTCT6TTTTAT6GTGTTTATGGCCAC 

■ ■ ' I 1 ' I ■ ■ ■ ■ ■ I I : : . I 1 ■ ' I I I ■ ■ ■ I 1 25576 

TCAAAAATATCAACACAACGTCATACCCGTAATATTCATATCGTCT6AAGAAATACCAAATTTCTACAGACAAAATACCACAAATACC66T6 

VFIVVLOYGH YK YSR LLYGLKMSV LWCLWP 
M 



TTGTTCTAGCTTTGTCTAtTTTTGACTGTTTTGTCAATTTTAATGTGGACTGGGTCTTTTTTGGTTTTAGTATTCTTATGTCTATTATTACA 

■ ■ ■ I I ■ ■ ■ { I I ■ ■ ■ ■ I ■ ■ ■ I ' \ .1 1 I i I ■ ■ I ■ ■ ■ ■ I ■ ■ ■ I I ■ ■ ■ 25668 

AACAAGATCGAAACAGATAAAAACTGACAAAACA6TTAAAATTACACCT6ACCCAGAAAAAACCAAAATCATAAGAATACAGATAATAATGT 

LVLALSIFDCFVNFNVDWVFFG FSILMSI IT 

: M 



CTTTGTTTAT66GTTATGTATTTTGTTAATAGTTTCAGACTTTGGCGCCGTGTTAAAACTTTTTGGGCTTTTAATCCTGAAACTAATGCAAT 

■I 1 ■ ■ 1 ' I I ■ I I ! ' I I I I 25760 

6AAACAAATACCCAATACATAAAACAATTATCAAAGTCTGAAACCGC6GCACAATTTT6AAAAACCC6AAAATTAG6ACTTTGATTACGTTA 

LCLWVMYFVNSFRLWRRVKTFWAF NPETNA I 
M 



CATCTCTCTCCAGGTTTATGGACATAATTATTACTTACCGGTGATGGCTGCACCTACAGGTGTTACATTAACACTTCTTAGTGGTGTACTTC 

1 I i I I 1 h*— — I — ■ ' ' I I i ' ■ 25852 

GTAGAGAGAGGTCCAAATACCTGTATTAATAATGAATGGCCACTACCGACGTGGATGTCCACAATGTAATTGTGAAGAATCACCACATGAAG 

ISLOVYGHNYYLPVMAAPTGVTLTLLSGVL 
M 



TTGTTGATGGCCATAAGATT6CTACTCGTGTTCAAGTGGGTCAGTTGCCTAAATAT6TAATAGTTGCTACACCTAGTACCACAATT6TTTGT 

I I I I ' ■ ■ ■ I I I I ■ ■ ■ ■ I ■ . ■ ■ I ■ I , 25944 

AACAACTACCGGTATTCTAACGATGAGCACAAGTTCACCCAGTCAACGGATTTATACATTATCAACGATGTGGATCATGGTGTTAACAAACA 

LVDGHKIATRV0V60LPKYVIVATPSTTIVC 
M 



GACCGT6TTG6TCGCTCTGTTAAT6AAACAAGCCAGACTGGTT66GCATTCTACGTCCGTGCTAAACATGGT6ATTTTTCTGGTGTTGCCTC 

I ' ■ I ■ ■ I I ■ ' t ■ I ■ ■ I ■ ■ ■ I ■ ■ ■ I i ■ I I I I I 1 h- 26036 

CTGGCACAACCAGC6AGACAATTACTTTGTTCGGTCTGACCAACCCGTAAGATGCAGGCACGATTTGTACCACTAAAAAGACCACAACGGA6 

0RVGRSVNETSQTGWAFYVRAKHGDFS6VAS 
_ M 
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TCAGGAGGGTGTTTTGTCAGAAAGAGAGAAGTTGCTTCATTTAATCTAAACTAAACAAAATGGCTAGTGT^^^ 

AGTCCTCCCACAAAACAGTCTTTCTCfCTTCAACGAAGTAAATTAGATTTGATTTGTTTTACCGATCACATTTAACCCGGCTACTGTCTCGA ^^'^^ 
0 E G V L S E R ^^ E K L L H L I ! . , M. A s V N W A 0 0 R A 

GCTAGGAAGAAATTTCCTCCTCCTTCATTTTACATGCCTCTTTTGGTTAGTTCTGATAAGGCACCATATAGGGTCATTCCCA^ 
CGATCCTTCTTTAAAGGAGGAGGAAGTAAAATGTACGGAGAAAACCAATCAAGACTATTCCGTGGTATATCCCAGTAAGGGTCCTTAGAACA 
^"K'^'^PPPSFYMPLL VSSDKAPYRVIPRNLV 

CCCTATTGGTAAGGGTAATAAAGATGA GCAGATTGGTTATTGGAATGTTCAAGAGCGTTGGCGTATGCGCAGGGGGCAACGTGTTGATTTGC 
' ' ' ' ' ' I i I I I ■ I I I ... I I , . 26315 

GGGATAACCATTCCCATTATTTCTACTCGTCTAACCAATAACCTTACAAGTTCTCGCAACCGCATACGCGTCCCCCGTTGCACAACTAAACG 

PIGKGNKDEOiGYW N VQERWRMRRGORVOL 
^ ' ^ 



CTCCTAAAGTTCATTTTTATTACCTA GGTACTGGACCTCATAAGGACCTTAAATTCAGACAACGTTCTGATGGTGTTGTTTGGGTTGCTAAG 
' ' ' ' ' ' ' ' ■ ■ ■ I ■ ■ . - ■ ■ I I I 26404 

GAGGATTTCAAGTAAAAATAATGGATCCATGACCTGGAGTATTCCTGGAATTTAAGTCTGTTGCAAGACTACCACAACAAACCCAACGATTC 
PPKVHFYYLGTGPHK D LKFRQRSDGVVWVAK 



GAAGGTGCTAAAACT GTTAATACCAGTCTTGGTAATCGCAAACGTAATCAGAAACCTTTGGAACCAAAGTTCTCTATTGCTTTGCCTCCAGA 
CTTCCACGATTTTGACAATTATGGTCAGAACCATTAGCGTTTGCATTAGTCTTTGGAAACCTT66TTTCAAGAGATAACGAAACGGAGGTCT 
EGAKTVNTSLGNRKR NQKPLE.PKFSIALPPE 

GCTCTCTGTTGTTGAGTTTGAGGATCGCTCTAATAACTCATCTCGTGCTAGCAGTCGTT^ 

CGAGAGACAACAACTCAAACTCCTAGC6A6ATTATTGAGTAGAGCACGATC6TCAGCAA6AAGTTGAGCATTGTTGAGTGCTCTGAGAAGA6 ^^^^^ 
LSVVEFEDR5NNSSR ASSRSSTRNNSRDSS 



GTAGTACTTCAAGACAA CAGTCTCGCACTCGTTCTGATTCTAACCAGTCTTCTTCAGATCTTGTTGCTGCTGTTACTTTGGCTTTAAAGAAC 

' * ' i I ) . , , , I J ^ ^ ^ 26680 

CATCATGAAGTTCTGTTGTCAGAGCGTGAGCAAGACTAAGATTGGTCAGAA6AAGTCTAGAACAACGACGACAATGAAACCGAAATTTCTTG 

^STSROQSRTRSDSN O SSSDLVAAVTLALKN 

^ ' ■ ■ ■ I 



TTAGGTTTTGATAACCAGTCGAAGT CACCTAGTTCTTCTGGTACTTCCACTCCTAAGAAACCTAATAAGCCTCTTTCTCAACCCAGGGCTGA 

' ' ' ' I * ' ■ ' I I r - i I I • 26772 

AATCCAAAACTATT66TCAGCTTCAGTGGATCAAGAA6ACCATGAA6GTGAGGATTCTTTGGATTATTCGGAGAAAGA6TT6GGTCCC6ACT 
LGFDNQSKSPSS SG TSTPKKPNKPLSQPR. AO 

TAAGCCTTCTCAGTTGAAGAAACCT CGTTGGAAGCGTGTTCCTACCAGAGAGGAAAATGTTATTCAGTGCTTTGGTCCTCGTGATTTTAATC 

* ' I I I 1 I ■ ■ I I , < , I I I , , . I . , , I .... I . I ■ . . , I , I 26864 

ATTCGGAAGAGTCAACTTCTTTGGAGCAACCTTCGCACAAGGATGGtCTCTCCTTTTACAATAAGTCACGAAACCAGGAGCACTAAAATTAG 

'^ PSQLKKPRWKRVPT REENVI0CF6PR0FM 

N ^ 
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ACAATATGGGGGATTCAGATCTTGTTCAGAATGGTGTTGAT.GCCAAGGGTTTTCCACAGCTTGCTGAATTGATTCCTAATCAGGCT6CGTTA 

I I ■ ■ ■ ■ I ■ I I I I I ■ , I I t ■ I . I I ■ ■ I I I I I - , I 1 , , I ■ 26956 

TGTTATACCCCCTAAGTCTA6AACAAGTCTTACCACAACTACGGTTCCCAAAAG6T6TCGAACGACTTAACTAAGGATTAGTCCGACGCAAT 

HNMGOSDLVQNGVDAKGFPQLAEL I PNQAAL 
N ■• 



TTCTTTGATAGTGAGGTTAGCACTGAT6AA6TGGGTGATAATGTTCAGATTACCTACACCTACAAAATGCTTGTAGCTAAG6ATAATAAGAA 

■ ■ ■ I I . . ■ ■ i ■ , I I I I I ■ ■ ■ I I I I eyo^is 

AAGAAACTATCACTCCAATC6T6ACTACTTCACCCACTATTACAAGTCTAAT6GAT6TGGATGTTTTACGAACATCGATTCCTATTATTCTT 

FFDSEVSTDEVGDNVQITYTYKMLVAKDNKN 
— N : 



CCTTCCTAAGTTCATTGAGCAGATTAGTGCTTTTACTAAACCCAGTTCTATCAAAGAAATGCAGTCACAATCATCTCATGTTGCTCAGAACA 

■ i 1 1 ■ ' I ■ ■ I I • \ • ■«''''! ' ' i ' ■ ■ I I I 27140 

GGAAG6ATTCAAGTAACTCGTCTAATCAC6AAAATGATTTGGGTCAAGATAGTTTCTTTACGTCAGTGTTAGTAGAGTACAACGAGTCTTGT 

LPKF I EQ l.SAFTKPS'S IKEMOSQSSHVAQN 
N 



CAGTACTTAATGCTTCTATTCCAGAATCTAAACCATTGGCTGATGATGATTCAGCCATTATAGAAATTGTCAACGAGGTTTTGCATTAAATT 

: 1 1 I I . , ■ ■ I I , , . I I I . I I I ■ ■ I . ■ ■ . I ■ ■ . . I i ■ . 27232 

GTCATGAATTACGAAGATAAGGTCTTAGATTTGGTAACCGACTACTAtTAAGTCGGTAATATCTTTAACAGTTGCTCCAAAACGTAATTTAA 

TVLNASIPESKPLAOOOSAI lEIVNEVLH. I3'" 
N • ' 

GTTTTGTAATTCCAGTTGAATGTTTATTATTATTA6TTGCAACCCCATGCGTTTAGCGCATGATAAG6GTTTAGTCTTACACACAATGGTAG 

I I ■ ■ I I ' I I ■ ■ : i I 1 ■ ■ - I 1 1— r-- 27324 

CAAAAGATTAAGGTCAACTTACAAATAATAATAATCAACGTT6GGGTACGCAAATCGCGTACTATTCGCAAATCAGAATGTGT6TTACCATC 



GCCAGTGATAGTAAAGTGTAAGTAATTTGCTATCATATTAACATGTCTAGAGGAAAGTCAGAACTTTTTCTGTTTGTGTTGTTGGAGTACTT 

I ■ ■ . . i I I ... « ■■■■ I I I ■ ■ ■ ■ I I I 27416 

CG6TCACTATCATTTCACATTCATTAAACGATAGTATAATTGTACAGATCTCCTTTCAGTCTT6AAAAAGACAAACACAACAACCTCATGAA 



AAAGATCGCATAG6C6CGCCAACAATGGAA6AGCCAACAACATATCTAAAAATGTTTTGTCT6GTACTTGTTAATGATATTGTTTTTGATAT 

■ I I .... I .... I I I I I I I 27508 

TTTCTAGCGTATCCGCGCGGTTGTTACCTTCTC6GTTGTTGTATAGATTTTTACAAAACA6ACCATGAACAATTACTATAACAAAAACTATA 



GGATACACAAAAAAAAAAAAAAAA 

'I I i ■ > ' 27532 

CCTATGTGTTTTTTTTTTTTTTTT 



i3UTR*—i""iiH 
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Fig. 4 Alignments 

2i. S' untranslated region (Genomic sequence) aligned with human 
coronavirus 229E 

....I. ...I ....I. ...I ....|..,.| ....I. ...I . I I. ...I 

5 15 25 35 45 55 

EMCRS'UTR AGATAGA GAATTTTCTT ATTTAGACTT TGTGTCTACT 

229E5'UTR ACTTAAGTAC CTTATCTATC TAG AGATAGA AAAGTTGCTT -TTTAGACTT TGTGTCTACT 

I I I 1 I I I I I I I I 

65 75 85 95 105 115 

EMCR5'UTR CCTCTCAACT AAACGAAATT TTT-CTAGTG CTGTCATTTG TTATG — GCA GTCCTAGTGT 

229E5'UTR TTTCTCAACT AAACGAAATT TTTGCTATGG CCGGCATCTT TGATGCTGGA GTCGTAGTGT 

I I I I I I I I I I I I 

125 135 145 155 165 175 

EMCRS'UTR AATTGAAATT TCGTCAAGTT TGTAA-ACTG GTTAGGCAAG TGTTGTATTT TCTGTGTTTA 

229E5'UTR AATTGAAATT TCATTTGGGT TGCAACAGTT TGGAAGCAAG TGCTGTGTGT CCTA-GTCTA 

I I I I I I I I I I I I 

185 195 205 215 225 235 

EMCRS'UTR AGCACTGGTG GTTCTGTC-C ACTAGTGCAC AC-ATTGATA CTTAAGT-GG TGTTCTGTCA 

229E5'UTR AGGGTTTCGT GTTCCGTCAC GAGATTCCAT TCTACAAACG CCTTACTCGA GGTTCCGTCT 

I I I I I I I I I I 

245 255 265 275 285 

EMCRS'UTR CTGCTTATTG TGGAAGCAAG GTTCTGTCGT TGTGGAAACC AATAACTGCT AACC 

229E5'UTR CGTGTTTGTG TGGAAGCAAA GTTCTGTCTT TGTGGAAACC AGTAACTGTT CCTA 



b. Putative Orf la 

— I — I — I — I — I — I — I — I — I — I — I — I 

5 15 25 35 45 55 

EMCR MFYNQVT LAVASDSEIS GFGFAIPSVA VRAYSEAAAQ GFQACRFVAF 

229E MACNRVT LAVASDSEIS ANGCSTIAQA VRRYSEAASN GFRACRFVSL 

PEDV MASNHVT LAFANDAEIS AFGFCTASEA VSYYSEAAAS GFMQCRFVSL 

TGEV MSSKQFK ILVNEDYQVN VPSLPIR-DV LQEIKYCYRN GFEGYVFVPE 

OC43 MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDMIC STTAQKLETD GICPENHVMV 

BOCOV MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDIVC STTAQKLETG GICPENHVMV 

MHV MAKMGKYGLG FKWAPEFPWM LPNASEKLGS PERSEEDGFC PSAAQEPKTK GKTLINHVRV 

AIPV MASSLKQ GVSPKPRDVI LVSKDIPEQL CDALFFYTSH NPKDYADAFA 

SARS COV MESLVLG VNEKTHVQLS LPVLQVRDVL VRGFGDSVBB ALSBAREHLK 

1 I I I I I I I I I I I 

65 75 85 95 105 115 

EMCR GLQDCVTGIN DDD-YVIALT GTNQLCAKIL LFSDRPLNLR GWLIFSNSNY VLQDFDWFG 

22 9E DLQDCIVGIA DDT-YVMGLH GNQTLFCNIM KFSDRPFMLH GWLVFSNSNY LLEEFDWFG 

PEDV DLADTVEGLL PED-YVMVVI GTTKLSAYVD TFGSRPRNIC GWLLFSNCNY FLEELELTFG 

TGEV YCRDLVDCDR KDH-YVIGVL GNGVSDLKPV LLTEPSVMLQ GFIVRANCNG VLEDFDLKIA 

OC43 DCRRLLKQEC CVQSSLIREI VMNASPYDLE VLLQDALQSR EAVLVTTPLG MSLEACYVRG 

BoCoV DCRRLLKQEC CVQSSLIREI VMNTRPYDLE VLLQDALQSR EAVLVTPPLG MSLEACYVRG 

MHV DCSRLPALEC CVQSAIIRDI FVDEDPLNVE ASTMMALQFG SAVLVKPSKR LSIQAWAKLG 

AIPV VRQKFDRSLQ TGKQFKFETV CGLFLLKGVD KITPG VPAKVLKATS KLADLEDIFG 

SARS CoV NGTCGLVELE KGVLPQLEQP YVFIKRSDAL STNHGHKWE LVAEMDGIQY GRSGITLGVL 

I I I I I I I I I I I I 

125 135 145 155 165 175 

EMCR — HGAGSVVF VDKYMCGFDG KPVLPKNMWE FRDYFNDNTD S-IVIGGVTY QLAWDVIRKD 

229E K-RGGGNVTY TDQYLCGADG KPVMSEDLWQ FVDHFGENEE — IIINGHTY VCAWLTKRKP 

PEDV — RRGGNIVP VDQYMCGADG KPVLQESEWE YTDFFADSED GQLNIAGITY VKAWIVERSD 

TGEV — RTGRGAIY VDQYMCGADG KPVIEG D FKDYFGDED IIEFEGEEY HCAWTTVRDE 

OC43 C-NPKGWTMG LFRRRSVCNT GRCTVNKHVA YQLYMIDPAG VCLGAGQ FVGWVIPLAF 

BoCoV C-NPNGWTMG LFRRRSVCNT GRCAVNKHVA YQLYMIDPAG VCFGAGQ FVGWVIPLAF 

MHV V-LPKTPAMG LFKRFCLCNT RECVCDAHVA FQLFTVQPDG VCLGNGR FIGWFVPVTA 

AIPV VSPLARKYRE LLKTACQWSL TVEALDVRAQ TLDEIFDPT- EILWLQVAAK 

SARS COV VPHVGETPIA YRNVLLRKNG NKGAGGHSYG IDLKSYDLGD ELGTDPIEDY EQNWNTKHGS 
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....I. ...I ....I.... I ....I. ...I ....I I I..., I 

185 195 205 215 225 235 

EMCR LSYEQQNVLA lESIHYLG-T TGHTLKSGCK LINAKPPKY- — SSKWLSG EWNAVYKAFG 

229E LDYKRQNNLA lEEIEYVHGD ALHTLRNGSV LEMAKEVKT- — SSKWLSD ALDKLYKVFG 

PEDV VSYASQNLTS IKSITYCS-T YEHTFLDGTA MKVARTPKI KKNWLSE PLATIYREIG 

TGEV KPLNQQTLFT IQEIQYNL-D IPHKLPHCAT RHVAPPVKK NSKIVLSE DYKKLYOIFG 

OC43 MPVQSRKFIV PWVMYLRKRG EKGAYNKDHG RGGFGH VYDFKVED AYDQVHDEPK 

BoCoV MPVQSRKFIA PWVMYLRKCG EKGAYIKDYK RGGFEH VYNFKVED AYDLVHDEPK 

MHV IPAYAKQWLQ PWSILLRKGG NKGSVTSGHF RRAVTMP — VYDFNVED ACEEVHLNPK 

AIPV IHVSSMAMRR LVGEVTAKVM DALGSNLSAL FQIVKQ QIARIFQK ALAIFENVNE 

SARS COV GALRELTREL NGGAVTRYVD NNFCGPDGYP LDCIKDFLAR AGKSMCTLSE QLDYIESKRG 

I.. ..I I I ( I I I I I I I 

245 255 265 275 285 295 

EMCR SPFITNGISL LDIIVKPVFF NAFVKCNCGS ENWSVGAWDG YLSSCCGTPA KKLCWPGNV 

229E SPVMTNGSNI LEAFTKPVFI SALVQCTCGT KSWSVGDWTG FKSSCCNVIS NKLCWPGNV 

PEDV SPFVDNGSDA RSIIRRPVFL HAFVKCKCGS YHWTVGDWTS YVSTCCGFKC KPVLVASCSA 

TGEV SPFMGNGDCL SKCFDTLHFI AATLRCPCGS ESSGVGDWTG FKTACCGLSG KVKGVTLGDI 

OC43 GKFSKKAYAL IRGYRGVKPL LYVDQYGCDY TGSLADGLEA YADKTLQEMK ALFPTWSQEL 

BoCoV GKFSKKAYAL IRGYRGVKPL LYVDQYGCDY TGGLADGLEA YADKTLQEMK ALFPIWSQEL 

MHV GKYSRKAYAL LKGYRGVKSI LFLDQYGCDY TGRLAKGLED YGDCTLEEMK ELFPVWCDSL 

AIPV LPQRIAALKM AFAKCARSIT VVWERTLVV KEFAGTCLAS INGAVAKFFE ELPNGFMGSK 

SARS CoV VYCCRDHEHB lAWFTERSDK SYEHQTPFEI KS — AKKFDT FKGECPKFVF PLNSKVKVIQ 

I I I I I I I I I I I I 

305 315 325 335 345 355 

EMCR VPGDVIITST DAGCGVKYYA GLVVKHITNI TGVSLWRVTA VHSDGMFVAT SSYDALLHRN 

229E KPGDAVITTQ QAGAGIKYFC GMTLKFVANI EGVSVWRVIA LQSVDCFVAS STFVEEEHVN 

PEDV MPGSVVVTRA GAGTGVKYYN NMFLRHVADI DGLAFWRILK VQSKDDLACS GKFLEHHEEG 

TGEV KPGDAVVTSM SAGKGVKFFA NCVLQYAGDV EGVSIWKVIK TFTVDETVCT PGFEGELN — 

OC43 LFDVIVAWHV VRDP RY VMRLQSAATI R SVAYVA NPTEDLCDGS VVIKEPVHVY 

BOCOV PFDVTVAWHV VRDP RY VMRLQSASTI R SVAYVA NPTEDLCDGS VVIKEPVHVY 

MHV DNEVWAWHV DRDP RA VMRLQTLATI R SIGYVG QPTEDLVDGD VWREPAHLL 

AIPV IFTTLAFFKE AAVR VVENIPNAP RGTKGFEWG NAKGTQVWR GMRNDLTLLD 

SARS CoV PRVEKKKTEG FMGRIRSVYP VASPQECKNM HLSTLMKCNH CDEVSWQTCD FLKATCEHCG 

I I I I I I I I I I I I 

365 375 385 395 405 415 

EMCR SLDPFCFDVN TLLSNQLRLA FLGASVTEDV KFAASTGVID ISAGMFGLYD DILTNNKPWF 

22 9E RMDTFCFNVR NSVTDECRLA MLGAEMTSNV RRQVASGVID ISTGWFDVYD DIFAESKPWF 

PEDV FTDPCYFLND SSLATKLKFD ILSGKFSDEV KQAIIAGHW VGSALVDIVD DALG — QPWF 

TGEV — DFIKPESK SLVACSVKRA FITGDIDDAV HDCIITGKLD LSTNLFGNVG LLFKK-TPWF 

OC43 ADDSIILRQY NLVDIMSHFY MEADTWNAF YGVALKDCGF VMQFGYIDCE QDSCDFKGWI 

BoCoV ADDSIILRQH NLVDIMSCFY MEADAWNAF YGVDLKDCGF VMQFGYIDCE QDLCDFKGWV 

MHV AANAIVKRLP RLVETMLYT- — DSSVTEFC YKTKLCDCGF ITQFGYVDCC GDACDFRGWV 

AIPV QKADIPVEPE GWSAILDGHL CYVFRSGDRF YAAPLSGNFA LSDVHCCERV VCLSDGVTPE 

SARS CoV -TENLVIEGP TTCGYLPTNA VVKMPCPACQ DPEIGPEHSV ADYHNHSNIE TRLR— KGGR 

1 I I I I 1 1 I I I 1 I 

425 435 445 455 465 475 

EMCR VRKASGLFDA IWDAFVAAIK LVPTTTGGLV RFVKSIASTV LTVSNGVIIM CADVPDAFQP 

229E VRKAEDIFGP CWSALASALK QLKVTTGELV RFVKSICNSA VAWGGTIQI LASVPEKFLN 

PEDV IRKLGDLASA PWEQLKAWR GLGLLSDEW LFGKRLSCAT LSIVNGVFEF LADVPEKLAA 

TGEV VQKCGALFVD AWKWEELCG SLTLTYKQIY EWASLCTSA FTIVNYKPTF VVPD-NRVKD 

OC43 PGNMIDGFAC TTCGHVYEVG DLIAQSSGVL PVNPVLHTKS AAGYGG -FGCKDSFTL 

BoCoV PGNMIDGFAC TTCGHVYETG DLLAQSSGVL PVNPVLHTKS AAGYGG -FGCKDSFTL 

MHV PGNMMDGFLC PGCSKSYMPW ELEAQSSGVI PKGGVLFTQS TDTVN RESFKL 

AIPV INDGLILAAI YSSFSVSELV TALKKGEPFK FLGHKFVYAK DAAVS FTL 

SARS CoV TRCFGGCVFA YVGCYNKRAY WVPRASADIG SGHTGITGDN VETLN EDLLEILS 

I I I I I I I I I I I I 

485 495 505 515 525 535 

EMCR VYRTFTQAIC AAFDFSLDVF KIG DVKF KRLGDYVLTE NALVRLTTEV VRGVRDARIK 

229E AFDVFVTAIQ TVFDCAVETC TIA GKAF DKVFDYVLLD NALVKLVTTK LKGVRERGLN 

PEDV AVTVFVNFLN EFFESACDCL KVG GKTF NKVGSYVLFD NALVKLVKAK ARGPRQAGIC 

TGEV LVDKCVKVLV KAFDVFTQII TIAGIEAKCF VLGAKYLLFN NALVKLVSVK ILGKKQKGLE 

OC43 YGQTVVYFGG CVYWSPARNI WIP — ILKSS VKSYDSLVYT GVLGCKAIVK ETNLICKALY 

BoCoV YGQTVVYFGG CVYWSPARNI WIP~ILKSS VKSYDGLVYT GVVGCKAIVK ETNLICKALY 

MHV YGHAWPFGS AVYWSPYPGM WLP — VIWSS VKSYADLTYT GWGCKAIVQ ETDAICRSLY 

AIPV AKAATIADVL RLFQSARVIA EDVWS-SFTE KSFEFWKLAY GKVRNLEEFV KTYVCKAQMS 

SARS Gov RERVNINIVG DFHLNEEVAI ILAS-FSAST SAFIDTIKSL DYKSFKTIVE SCGNYKVTKG 

I.... I ..I ....I. ...I ....I I ....I. ...I 

545 555 565 575 585 595 

EMCR KAMFTKVVVG PTTEVKFSVI ELATVNLRLV DCAPWCPKG KIWIAGQAF FYSGGFYRFM 

229E KVKYATVVVG STEEVKSSRV ERSTAVLTIA NNYSKLFDEG YTVVIGDVAY FVSDGYFRLM 

PEDV EVRYTSLVVG STTKVVSKRV ENANVNLVW DEDVTLNTTG RTVWDGLAF FESDGFYRHL 

TGEV CAFFATSLVG ATVNVTPKRT ETATISLNKV DDVVAPG-EG YIVIVGDMAF YKSGEYYFMM 

OC43 LDYVQHKCGN LHQRELLGVS DVWHKQLLLN RGVYKPLLEN IDYFNMRRAK FSLETFTVCA 

BoCoV LDYVQHKCGN LHQRELLGVS DVWHKQLLLN RGVYKPLLEN IDYFNMRRAK FSLETFTVCA 

MHV MDYVQHKCGN LEQRAILGLD DVYHRQLLVN RGDYSLLLEN VDLFVKRRAE FACK-FATCG 

AIPV IVILAAVLGE DIWHLVSQVI YKLGVLFTKV VDFCDKHWKG FCVQLKRAKL IVTETFCVLK 

SARS CoV KPVKGAWNIG QQRSVLTPLC GFPSQAAGVI RSIFARTLDA ANHSIPDLQR AAVTILDGIS 
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EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEOV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS Gov 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BOCOV 

MHV 

AIPV 

SARS CoV 



....I. ...I 

605 
VDSTTVLNDP 
ASPNSVLTTA 
ADADVVIEHP 
SSPNFVLTNN 
DGFMPFLLDD 
DGFMPFLLDD 
DGLVPLLLDG 
GVAQHCFQLL 
EQSLRLVDAM 



....I.. ..I 

665 
DFKTAVFVYT 
KITEFQLDYS 
NYNTPYKTYS 
EFRQQSLCFR 
DSLGAAIHYL 
DSLGAAIHYL 
HDVKVATKYV 
OGDEIWFDAI 
EKLRPIFEWI 



....I.. ..I 

615 

VFTGELFYTI 
VYKPLFAFNV 
VYKSACELKP 
VFKAVKVPSY 
LVPRAYYLAV 
LVPRAYYLAV 
LVPRSYYLIK 
LDAIHSLYKS 
VYTSDLLTNS 



I 



635 



U 

625 

KFSGFKLDGF N H 

NVMGTRPE 

VFECDPIP — D F 

DIVYDVDNDT KSKMIAKLGS 

SGOAFCDY — 

SGQAFCDY 

SGQAFTSM 

FKKCALGR 

VIIMAYVTG 



645 
QFVNASSATD 
KFPTTVTCEN 
PLPVAASVAE 
SFEYDGDIDA 
ADKLCHAVVS 
AGKICHAWS 
MVNFSHEVTD 

IHGDLLF 

— GLVQQTSQ 



1 I 

655 
AZIAVELLLS 
LESAVLFVND 
LCVQTDLLLK 
AIVKVNELLI 
KSKELLDVSL 
KSKELLDVSV 
MCMDMALLFM 
WKGGVHKIVQ 
WLSNLLGTTV 



I I 

675 
CVVDGCSVIV 
IDVIDNEIIV 
CVVRGDKCCI 
AFKDDKSIFV 
NSKIVDLAQH 
NSKIVDLAQH 
KKVTGKLAVR 
DSVDVEDLGV 
EAKLSAGVEF 



I I 

725 
LQSNNPQCAI 
ISVLDITDAA 
HEQQDLQGFL 
NELEDIKETN 
HGAYIWESD 
HGAYIWESD 
NGLFAVANGG 
RFKKDENIYY 
CFIDWNKAL 



I I 

735 
VQASESK— V 
VKAAESK~A 
TTCCTMSGFE 

IQAIKN 

lYFVKN 

lYFGKN 

ITFLSD 

TPMSQLG 

BMCIDQ 



I I 

685 
RRDAT-FATH 
KPNIS-LCVP 
TCTLQ-FKAP 
EAYFKKYKMP 

FSDFG 

FSDFG 

FKALG 

VQEKS ID 

LKDAW 



I I 

695 
VCFKDCYSIW 
LYVRDYVDKW 
SYVEDAVN-F 
ACLAKHIG-L 
TSFVSKIVHF 
TSFVSKIVHF 
VAWRKITEW 
FEVCDDVTLP 
EILKFLITGV 



705 
EQFCIDNCGE 
DDFCRQYSNE 
VDLCTKNIGT 
WNIIKKDSCK 
FKTFTTSTAL 
FKTFTTSTAL 
FDLAVDTAAS 
ENQFGHMVQI 
FDIVKGQIQV 



715 
PWFLTDYNAI 
SWFEDDYRAF 
AGFHEFYITA 
RGFLNLFNHL 
AFAWVLFHVL 
AFAWVLFHVL 
AAGWLCYQLV 
EDDGKNYMFF 
ASDNIKDCVK 



I I 

745 
LLERFLPKCP 
FVDTIVPPCP 
CFMPTIPQCP 

ILCP 

-IPRYASAVA 
-IPRYASAVA 
-VPELVKNFV 

AINVVCK 

VTIAG 



I I 

755 
EILLSIDDGH 
SILKVIDGGK 
AVLEEIDGGS 
DPLLDLDYGA 
QAFQSVAKW 
QAFRSGAKVG 
DKFKVFFKVL 
AGGKTVTFG- 
AKLRSLNLGE 



765 
LWNLFVEKFN 
IWNGVIKNVN 
IWRSFITGLN 
IWYNCMPGCS 
LDSLRVTFID 
LDSLRVTFID 
IDSMSVSVLS 
— ETTVQEIP 
VFIAQSKGLY 



I I 

775 
FVTDWLKTLK 
SVRDWLKSLK 
TMWDFCKRLK 
DP-SVLGSVQ 
GLSCFKIGRR 
GLSCFKIGRR 
GLTVVKTASN 
PPDVVPIKVS 
RQCIRGKEQL 



....I. ...I 

785 
LTLTSNGLLG 
LNLTQQGLLG 
VSFGLDGIVV 
LLIGNG — VK 
RICLSGRKIY 
RICLSGSKIY 
RVCLAGCKVY 
lECCGEPWNT 
QLLMPLKAPK 



I I 

795 
NCAKRFRRVL 
TCAKRFKRWL 
TVARKFKRLG 
VVCDGCKGFA 
EVERGLLHSS 
EVERGLLHSS 
EVVQKRLSAY 
XFKKAYKEPI 
EVTFLEGDSH 



....I.... I 

805 
VKLLDVYNGF 
GILLEAYNAF 
ALLAEMYNTY 
NQLSKGYNKL 
QLPLDVYDLT 
QLPLDVYDLT 
VMPVGCNEAT 
EVDTDLTVEQ 
DTVLTSEEW 



I I 

815 
LETVCSWHT 
LDTWSTVKI 
LSTWENLVL 
CNAARNDIEI 
MPSQVQKAKQ 
MPSQVQKTKQ 

C 

LLSVIYEKMC 
LKNGELEALE 



825 
AGVCIKYYAV 
GGLTFKTYAF 
AGVSFKYYAT 
GGIPFSTFKT 
KPIYLKGSGS 
KGIYLKGSGS 

LVGEIE 

DDLKLFPEAP 
TFVDSFTNGA 



I I 

835 
NVP-YWISG 
DKP-YIVIRD 
SVP-KIVLGG 
PTNTFIEMTD 
DFSLADSVVE 
DFSLADSVVE 
PAVVEDDWD 
EPPPFENVAL 
IVGTPVCVNG 



I I 

845 
FVSRVIRRER 
IVCKVENKTE 
CFHSVKSVFA 
AIYSVIEQGK 
VVTTSLTPCG 
WTTSLTPCG 
VVKAPLTYQG 
VDKNGKDLDC 
LMLLEIKDKE 



....|..,.| ..I 

855 865 
CD — VTFPCV SCVTFFYEFL 
AEWIELFPHN DRIKSFSTFE 
SV— FQIPVQ AGIEKFKVFL 



AL- 

YS— 
YS— 
CC— 
IKS- 
QY— 



S- 



-FR 



-EPP KVADKICIVD 
-EPP KVADKICIVD 
-KPP TSFEKICWD 

CHLI 

CALS 



875 
DTCFGVSK — 
SAYMPIAD — 

NCVHPW 

DADVPVVDNG 
NVYMAKAGDK 
NVYMAKAGDK 
KLYMAKCGDQ 

YRDYESD 

PGLLATN 



885 
— PNAIDVEH 
— PTHFDIEE 
— PRVIETSF 
TISTADWSEP 
YYPVVVD-DH 
YYPVVVD-GH 
FYPVVVDNDT 

DD 

NV 



I I 

895 
LELKETVFVE 
VELLDAEFVE 
VELEETTFKP 
ILLEPAEYVK 
VGLLDQAWRV 
VGLLDQAWRV 
IGVLDQCWRF 
lEEEDAEECD 
FRLKGGAPIK 



I I 

905 
PKDGGQFFVS 
PGCGGILAVI 
PALNGGIAIV 
PKNNGNVIVI 
PCAGRRVTFK 
PCAGRCVTFK 
PCAGKKVEFN 
TDSGEAEECD 
GVTFGEDTVW 



1 I 

915 
DDYLWYVV-D 
DEHVFYKK-D 
DGFAFYYD-G 
AGYTFYKDED 
EQPTVKEIIS 
EQPTVNEIAS 
DKPKVKEIPS 
TNSECEEEDE 
EVQGYKNVRI 



925 



I 



-lY 
-VY 
-LY 
-HF 



....I.. ..I 

965 
IVHDVEPTHK 
EVKDIEPVYR 
SVKTIDPVYK 
DVQEIAPVTR 
VIDAIEEKLS 
VIDAIEEKLS 
VLDAVESTLS 
HKDALDVVNL 
VAEAVVKTLQ 



I I 

975 
VKLIFEFEDD 
VKLCFEFEDE 
VSLEFEFESE 
VKLEFEFDNE 
PCKELEGVGA 
PCKELEGVGA 
PCKEHDVIGT 
PSGEETFWN 
PVSDLLTN — 



MPKIIKVFYE 
TPKTIKVFYE 
T-RKIKINFA 

D TK 

T FE 



935 
YPASCNGVLP 
YPSNGTNILP 
YPTDGNSVVP 
YPYGFGKIVQ 
LDNDFNTILN 
LDKDFNTILN 
LDATFDSVLS 
VLALIQDPAS 
LDERVDKVLN 



945 
VAFTKLAGGK 
VAFTKAAGGK 
ICFKKKGGGD 
RMYNKMGGGD 
TACGVFEVDD 
TACGEFEVDD 
KACSEFEVDK 
IKYPLPLDED 
EKCSVYTVES 



I I 

955 

ISFSDDV 

VSFSDDV 

VKFSDEV 

KT-VSFSEEV 
TVDMEEFYAV 
TVDMEEFYAV 
DVTLDELLDV 
YS-VYNGCIV 
GTEVTEFACV 



I I 

985 
-VVTSLCKKS 
-KLVDVCEKA 
-TIMAVLNKA 
-IVTGVLERA 
-KVSAFLQKL 
-KVSAFLQKL 
-KVCALLNRL 
NCFEGAVKPL 
— MGIDLDEW 



995 
FGKSIIYTG- 
IGKKIKHEG- 
VGNRIKVTG- 
IGTRYKFTGT 
EDNPLFLFD- 
EDNSLFLFD- 
AEDYVYLFD- 
PQKVVDVLG- 
SVATFYLFD- 



1005 
DWEGLHEVLT 
DWDSFCKTIQ 
GWDDVVEYIN 
TWEEFEESIS 
— EAGEEVLA 
— EAGEEVLA 
—EGGEEVIA 
— DWGEAVDA 
— DAGEENFS 



I I 

1015 

SAMNVIG 

SALSVVS 

VAIEVLK 

EELDAIFDTL 
PKLYCAFTAP 
PKLYCAFTAP 
PKMYCSFSAP 

QEQLCQQ 

SRMYCSFYPP 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



43/87 



PCT/NL2004/00080S 



I I 1 I I 1 I I I I 

1025 1035 1045 1055 1065 1075 

EMCR — QHIKLPQF YIYDEEGGYD VSKP — VMIS QWPISDDSDG CWEASTDFH Q — LESVREE 

229E — CYVNLPTY YIYDEEGGND LSLP — VMIS EWPLSVQQAQ QEATLPDIAE D — VVDQVEE 

PEDV — DHVEVPKY YIYDEEGGTD PNLP — VMVS QWPLNDDTIS QDLLDVEVVT DAPIDSEGDE 

TGEV ANQGVELEGY FIYDTCGGFD IKNPDGIMIS QYDINITADE KSEVSASSEE EE-VESVEED 

OC43 EDDDFLEESD VEEDDVEGEE TDLTVTSAGQ PCVASEQEES SEVLEDTLDD GPSVETSDSQ 

BoCoV EDDDFLEESG VEEDDVEGEE TDLTVTSAGE PCVASEQEES SEILEDTLDD GPCVETSDSQ 

MHV DDEDCVAADV VDADENQGDD ADDSAALVTD TQEEDGVAKG QVGVAESDAR LDQVEAFDIE 

AIPV — EPLQHTFE EPVENSTGSS KTMTEQVWE DQELPVVEQD QDWVYTPTD LEVAKETAEE 

SARS COV DEEEEDDAEC EEEEIDETCE HBYGTEDDYQ GLPLEFGASA ETVRVEEEEE EDWLDDTTEQ 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS COV 



I 



I 



1095 



1105 



I 



I 



I 



1 . 



1135 



I 



1115 1125 

HE QPFGEVEHAL SIRQ 

IFD lETVDVKHDV S 

-D VANSEPGDDG LPVAPETNVE SEVBEVAATL SFIKDTPSTV 



....I.... I 
1085 

VD 

VNS 

VDSSAPERVA 

PENEIVEASE GAEGTSSQEE VETVEVADIT STEBDVDIVE VSAKDDPWAA AVDVQEAEQF 

VEEDVEMS— DFVDL ESVIQD YENVCFEF YTT 

VEEDVQMS — DFGDL ESVIQD — YENVCFEF YTT 

KVEDPILN — ELSAE LNAPADK TYEDVLAFD AIYSEALSAF YAVP 

VD 

SEIEPEP E PTPEEPVNQF TG 



I I I I I I I I I I I I 

1145 1155 1165 1175 1185 1195 

EMCR PFSFSFR DELGVRVLDQ SDNNCWISTT LIQLQLTKLL DDSIEMQLFK VGKVDSIVQK 

229E PFEMPFE ELNGLKILKQ LDNNCWVNSV MLQIQLTGIL DGDYAMQFFK MGRVAKMIER 

PEDV TKDPFAFDFV SYGGLKVLRQ SHNNCWVTST LVQLQLLGIV DDP-AMELFS AGRVGPMVRK 

TGEV NPSLPPFKTT NLNGKIILKQ GDNNCWINAC CYQLQAFDFF NNE-AWEKFK KGDVMDFVNL 

OC43 EPEFV KVLGLYVPKA TRNNCWLRSV LAVMQKLPCQ FKD— KNLQD LWVLYKQQYS 

BoCoV EPEFV KVLDLYVPKA TRNNCWLRSV LAVMQKLPCQ FKD~KNLQD LWVLYKQQYS 

MHV GDETHF KVCGFYSPAI ERTNCWLRST LIVMQSLPLE FKD— LEMQK LWLSYKSSYN 

AIPV EFILIFAVPK EEVVSQKDGA QIKQEPIQW KPQ— REKKA KKFKVKPATC 

SARS COV YLKLTD NVAIKCVDIV KEAQSANPMV IVNAANIHLK HGGGVAGALN KATNGAMQKE 



I I I I I I I I 1 I 1 I 

1205 1215 1225 1235 1245 1255 

EMCR CYELSHLISG SLGDSGKLLS ELLKDKYTCS ITFEMSCDCG KKFDEQVGCL FWIMPYTKLF 

22 9E CYTAEQCIRG AMGDVGLCMY RLLKDLHTGF MVMDYKCSCT SGRLEESGAV LFCTPTKKAF 

PEDV CYESQKAILG SLGDVSACLE SLTKDLHTLK ITCSWCGCG TGERIYEGCA FRMTPTLEPF 

TGEV CYAATTLARG HSGDAEYLLE LMLNDYSTAK IVLAAKCGCG EKEIVLERAV FKLTPLKESF 

OC43 QLFVDTLVNK IPANIVLPQG GYVADFAYWF LTLCDWQCVA YWKCIKCDLA LKLKGLDAMF 

BoCoV QLFVDTLVNK IPANIVVPQG GYVADFAYWF LTLCDWQCVA YWKCIKCDLA LKLKGLDAMF 

MHV KEFVDKLVKS VPKSIILPQG GYVADFAYFF LSQCSFKAYA NWRCLKCDMD LKLQGLDAMF 

AIPV EKPKFLEYKT CVGDLTWIA KALDEFKEFC IVNAANEHMT HGSGVAKAIA DFCGLDFVEY 

SARS CoV SDDYIKLNGP LTVGGSCLLS GHNLAKKCLH VVGPNLNAGE DIQLLKAAYE NFNSQDILLA 



I 1 I I I I I I I I 1 1 

1265 1275 1285 1295 1305 1315 

EMCR QKGECCICHK MQTYKLVSMK GTGVFVQD— PAPIDIDAFP VRPICSSVYL 6VKGSGHYQT 

229E PYGTCLNCNA PRMCTIRQLQ GTIIFVQQK- PEPVNPVSFV VKPVCSSIFR GAVSCGHYQT 

PEDV PYGACAQCAQ VLMHTFKSIV GTGIFCRD— TTALSLDSLV VKPLCAAAFI GK-DSGHYVT 

TGEV NYGVCGDCMQ VNTCRFLSVE GSGVFVHDIL SKQTPEAMFV VKPVMHAVYT GTTQNGHYMV 

OC43 FYGDVVSHIC KCGESMVLID VDVPFTAHFA LKDKLFCAFI TKRIVYKAAC VVDVNDSHSM 

BoCoV FYGDWSHVC KCGESMVLID VDVPFTAHFA LKDKLFCAFI TKRSVYKAAC VVDVNDSHSM 

MHV FYGDWSHVC KCGTGMTLLS ADIPYTLHFG LRDDKFCAFY TPRKVFRAAC VVDVNDCHSM 

AIPV CEDYVKKHGP QQRLVTPSFV KGIQCVNNVV GPRHGDNNLH EKLVAAYKNV LVDGVVNYW 

SARS CoV PLLSAGIFGA KPLQSLQVCV QTVRTQVYIA VNDKALYEQV VMDYLDNLKP RVEAPKQEEP 



I I I I I I I 1 I I I I 

1325 1335 1345 1355 1365 1375 

EMCR NLYSFDKAID GFGVFDIK — NSSV NTVCFVDVDF HS-VEIEAGE 

229E NIYSQNLCVD GFGVNKIQP- WTNDAL NTICIKDADY NAKVEISVTP 

PEDV NFYDAAMAID GYGRHQIK — YDTL NTICVKDVNW TAPLVPAVDS 

TGEV DDIEHGYCVD GMGIKPLKKR CYTSTLFINA NVMTEIAEKPK QEFKVEKVEQ QPIVEENKSS 

OC43 AWDG-KQID DHRITSIT — SDK FDFIIGHGMS FSMTTFEIAQ 

BoCoV AWDG-KQID DHRITSIT SDK FDFIIGHGTS FSMTTFEIAQ 

MHV AWDG-KQID GKVVTKFN GDK YDFMVGHGMA FSMSAFEIAQ 

AIPV PVLSLGIFGV DFKMSIDAMR EA FEGCTIRVLL FSLSQEHIDY 

SARS CoV PNTEDSKTEE KSWQKPVDV KP KIKACIDE VTTTLEETKF LTNKLLLFAD 



I I I I I I I I I 1 I I 

1385 1395 1405 1415 1425 1435 

EMCR VK PFAVYKNVKF YLGDISHLVN CVSFDFVVNA ANENLMHGGG 

229E IKNTVDTTPK EEFVVKEKLN AFLVHDNVAF YQGDVDTWN GVDFDFIVNA ANENLAHGGG 

PEDV VVEP VVK PFYSYKNVDF YQGDFSDLVK -LPCDFVVNA ANEKLSHGGG 

TGEV lEKEEIQSPK ND DLIL PFYKAGKLSF YQGALDVLIN FLEPDVIVNA ANGDLKHMGG 

OC43 LYG SCITPNVCF VKGDIIKVSK LVKAEVVVNP ANGHMAHGGG 

BoCoV LYG SCITPNVCF VKGDIIKVSK RVKAEVVVNP ANGHMAHGGG 

MHV LYG SCITPNVCF VKGDVIKVLR RVGAEVIVNP ANGRMAHGAG 

AIPV FD VTC KQKTIYLTED GVKYRSIVLK PGDSLGQFGQ 

SARS CoV INGKLYHD SQ NMLRGEDMSF LEKDAPYMVG DVITSGDITC WIPSKKAGG 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



44/87 



PCT/NL2004/000805 



I j 1 I I I I ! I 1 I I 

1445 1455 1465 1475 1485 1495 

EMCR VARAIDILTE GQLQSLSKDY ISSNGPLKVG AGVMLE— CE KFNVFNWGP RTG KHEH 

229E LAKALDVYTK GKLQRLSKEH IGLAGKVKVG TGVMVE — CD SLRIFNWGP RKG KHER 

PEDV lAKAIDVYTK GMLQKCSNDY lECAHGPIKVG RGVMLE— AL GLKVFNVVGP RKG KHAP 

TGEV VARAIDVFTG GKLTERSKDY LKKNKSIAPG NAVFFENVIE HLSVLNAVGP RNGD— SRVE 

OC43 VAKAIAVAAG QQFVKETTDM VKSKGVCATG DCYVSTGGKL CKTVLNWGP DARTQGKQSY 

BoCoV VAKAIAVAAG QQFVKETTDM VKSKGVCATG DCYVSTGGKL CKTVLNWGP DARTQGKQSY 

MHV VAGAIAKAAG KSFIKETADM VKNQGVCQVG ECYESTGGNL CKTVLNIVGP DARGHGKQCY 

AIPV VYAKNKIVFT ADDVEDKEIL YVPTTDKSIL EYYGLD A QKYVIYLQTL AQKWNVQYRD 

SARS CoV TTEMLSRALK KVPVDEYITT YPGQGCAGYT LEEAKTALKK CKSAFYVLPS EAPNAKEEIL 

I I I I I I I I I I I I 

1505 1515 1525 1535 1545 1555 

EMCR SLLVEAYNSI LF ENGIP LMPLLSCGIF GVRIENSLKA LFSCDINKPL QVFVYSSNEE 

22 9E DLLIKAYNTI NN EQGTP LTPILSCGIF GIKLETSLEV LLDVCNTKEV KVFVYTDTEV 

PEDV ELLVKAYKSV FA NSGVA LTPLISVGIF SVPLEESLSA FLACVGDRHC KCFCYGDKER 

TGEV AKLCNVYKAI AK CEGKI LTPLISVGIF NVRLETSLQC LLKTVNDRGL NVFVYTDQER 

004 3 VLLERVYKHL N NYDCV VTTLISAGIF SVPSDVSLTY LLGTAKKQVV LVSNNQEDFD 

BoCoV ALLERVYKHL N KYDCV VTTLISAGIF SVPSDVSLTY LLGTAKKQVV LVSNNQEDFD 

MHV SFLERAYQHI N KCDDV VTTLISAGIF SVPTDVSLTY LIGVVTKNVI LVSNNKDDFD 

AIPV NFLILEWRDG N CWISS AIVLLQAAKI RFKGFLTEAW AKLLGGDPTD FVAWCYASCT 

SARS CoV GTVSWNLREM LAHAEETRKL MPICMDVRAI MATIQRKYKG IKIQEGIVDY GVRFFFYTSK 

I ,...1 I I I I. ...I 

1565 1575 1585 1595 1605 1615 

EMCR QAVLKFLDGL DLTPVID DVDW -KPFRVEGNF SFFDCG V 

22 9E CKVKDFVSGL VNVQKVE QPKIE PKPVSVIKVA PKPYRVDGKF SYFTED L 

PEDV EAIIKYMDGL VDAIFKEALV DTTPVQEDVQ QVSQKPVLPN FEPFRIEGAH AFYECNPEGL 

TGEV QTIENFFS — 

OC43 LISKCQITAV EG T 

BoCoV LISKCQITAV EG T 

MHV VIEKCQVTSI AG T 

AIPV AKVGDFSDAN 

SARS CoV EPVASIITKL N S 

I I f 1 I I I I I I I I 

1625 1635 1645 1655 1665 1675 

EMCR NALDGD-IYL LFTNSILMLD KQGQLLDTKL NGILQQAVLD YLATVKTVPA GNLVKLWE- 

229E LCVADDKPIV LFTDSMLTLD DRGLALDNAL SGVLSAAIKD CVDINKAIPS GNLIKFDIG- 

PEDV MSLGAD-KLV LFTNSNLDFC SVGKCLNDVT SGALLEAINV FKKSNKTVPA GNCVTLDCAN 

TGEV 

OC43 KKLAARLSFN VGRSIVYETD ANKLILIN DVAFVSTFN VLQDVLSLRH DIALDDDART 

BoCoV KKLAERLSFN VGRSIVYETD ANKLILSN DVAFVSTFN VLQDVLSLRH DIALDDDART 

MHV KALSLQLAKN LCRDVKFETN ACDSLFS -DSCFVSSYD VLQEVELLRH DIQLDDDARV 

AIPV WLLA NLAEHFDADY 

SARS CoV LNEPLVTMPI GYVTHGFNLE EAARCMR -SLKAPAVVS VSSPDAVTTY NGYLTSSSKT 

I I I I I I I I 1 I I i 

1685 1695 1705 1715 1725 1735 

EMCR SCTIYMCVVP SI-NDLSFDK NLGRCVRKLN RLKTCVIANV PAIDVLKKLL SSLTLTVKFV 

229E SVWYMCWP SE-KDKHLDN NVQRCTRKLN RLMCDIVCTI PADYILPLVL SSLTCNVSFV 

PEDV MISITMWLP FD-GDANYDK NYARAVVKVS KLKGKLVLAV DDATLYSKLS — HLSVLGFV 

TGEV CSIP 

OC43 FVQSNVDVVP EG-WRVVNKF YQINGVRTVK YFECTGGIDI CSQDKVFGYV QQGIFNKATV 

BoCoV FVQSNVDVVP EG-WRVVNKF YQINGVRPVK YFECPGGIDI CSQDKVFGYV QQGSFNKATV 

MHV FVQAHMDNLP AD-WRLVNKF DSVDGVRTVK YFECPGEIFV SSQGKKFGYV QNGSFKVASV 

AIPV TNAFLKKRVS CN CG 

SARS CoV SEEHFVETVS LAGSYRDWSY SGQRTELGVE FLKRGDKIVY HTLESPVEFH LDG — EVLSL 

I 1 I I I I I I I I I I 

1745 1755 1765 1775 1785 1795 

EMCR VESNVMDVND CFKNDNWLK ITEDGINVKD VVVESSKSLG KQLG-VVSDG VDSFEGVLP- 

229E GELKAAEA— KVITIK VTEDGVNVHD VTVTTDKSFE QQVG-VIADK DKDLSGAVPS 

PEDV STPDDVER — FYANKSWIK VTEDTRSVKA VKVESTATYG QQIG-PCLVN DTVVTDNKP- 

TGEV VN VTEDNVNHER VSVSFDKTYG EQLKGTWIK DKDVTNQLPS 

OC43 AQIKALFLD- KVDIL LTVDGVNFTN RFVPVGESFG KSLG-NVFCD GVNVTKHKCD 

BoCoV AQIKALFLD- KVDIL LTVDGVNFTN RFVPVGESFG KSLG-NVFCD GVNVTKHKCD 

MHV SQIRALLAN KVDVL CTVDGVNFRS CCVAEGEVFG KTLG-SVFCD GINVTKVRCS 

AIPV IKSYB LRGLEACIQP VRATNLLHFK TQYSNCPTCG ANNTDEVIEA 

SARS CoV DKLKSLLSLR — EVKTIKVF TTVDNTNLHT QLVDMSMTYG QQPG-PTYLD GADVTKIKPH 

....I I I., ..I I I I I I I I I 

1805 1815 1825 1835 1845 1855 

EMCR -INTDTVLSV APEVDWVAFY GFEKAALFAS LDVKPYG YPNDF VGGFRVLGTT 

229E DLNTSELLTK AIDVDWVEFY GFKDAVTFAT VDHSAFA YESAV VNGIRVLKTS 

PEDV -VVADVVAKV VPNANWDSHY GFDKAGEFHM LDHTGFT FPSEV VNGRRVIKTT 

TGEV AFDVGQKVIK AIDIDWQAHY GFRDAAAFSA SSHDAYK FEVVT HSNFIVHKQT 

OC43 INYKGKVFFQ FDNLSSEDLK AVRSSFNFDQ KELLAYYNML VNCFKWQVW NGKYFTFKQA 

BoCoV INYKGKVFFQ FDNLSSEDLK AVRSSFNFDQ KELLAYYNML VNCSKWQVVF NGKYFTFKQA 

MHV AIHKGKVFFQ YSGLSAADLV AVTDAFGFDE PQLLKYYNML G-MCKWPVVV CGNYFAFKQS 

AIPV SLPYLLLFAT DGPATVDCDE DAVGTVVFVG STNSGHCY — TQA AGQAFDNLAK 

SARS CoV VNHEGKTFFV LPSDDTLRSE AFBYYHTLDE SFLGRYMSAL NHTKKWKFPQ VGGLTSIKWA 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

45/87 



I 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS COV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS COV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



1865 
DNNCWVNATC 
DNNCWVNAVC 
DNNCWVNVTC 
DNNCWINAIC 
NNNCFVNVSC 
NNNCFVNVSC 
NNNCYINVAC 
DRKFGKKSPY 
DNNCYLSSVL 



I I 

1925 
ALSKLSEYLI 
TLTKLSKYLA 
ALNMLSKYIV 
MLHKLGDLMD 
FLRVVFSQVD 
FLRVVFSQVD 
FMRWLREAD 
FEQWYDSNIY 
TMTHLLQHAN 



1875 
IILQYLKPTF 
lALQYSKPHF 
LQLQFARFRF 
LALQRLKPQW 
LMLQSLHLTF 
LMLQSLNLKF 
LMLQHLSLKF 
ITAMYTRFAF 
LALQQLEVKF 

I I 



I 



1885 
KSKGLNVLWN 
ISQGLDAAWN 
KSAGLQAMWE 
KFPGVRGLWN 
KIVQWQEAWL 
KIVQWQEAWL 
HKWQWQEAWN 
KN-ETSLPVA 
NAPALQEAYY 



1895 
KFVTGDVGPF 
KFVLGDVEIF 
SYCTGDVAMF 
EFLERKTQGF 
EFRSGRPARF 
EFRSGRPARF 
EFRSGKPLRF 
KQSKGKSKSV 
RARAGDAANF 



I I 

1905 
VSFIYFITMS 
VAFVYYVARL 
VHWLYWLTGV 
VHMLYHISGV 
VALVLAKGGF 
VSLVLAK6GF 
VSLVLAKGSF 
KEDVSNLATS 
CALILAYSNK 



I I 

1915 
SKGQKGDAEE 
MKGDKGDAED 
DKGQPSDSEN 
KKGEPGDAEL 
RFGDPADSRD 
KFGDPADSRD 
KFNEPSDSTD 
SKASFDNLTD 
TVGELGDVRE 



1935 



..I I 

1945 
-DSIVTLE 
-EAQVQLE 
-AGSVTIE 
-DCEIIVT 



S 

N 

p 

N 

LTGAICDF-E lACKCGVKQE 
LTGAICDF-E lACKCGVKQE 
LSGATCDF-E FVCKCGVKQE 

ES LKVQE 

LESAKRVLNV VCKHCGQKTT 



....I I ) I I I 

1955 1965 1975 

QYSTCDIC — 

HYSSCVECDA K 

RVTHDGCC 

HTTACDKC— 

QRTGLDAVMH FGTLSREDLE IGYTVDCSCG 
QRTGVDAVMH FGTLSREDLE IGYTVDCSCG 
QRKGVDAVMH FGTLDKGDLA KGYTIACTCG 

SPDNFDKY — 

TLTGVEAVMY MGTLSYDNLK TGVSIPCVCG 



..I I 

1985 



I 



1995 



2005 

KSTVVEVKSA 

F KNSVASINSA 

-CSKRVVTAP 

-AKVEKFVGP 

KKLIHCVRFD VP — FLICSN TPASVKLPKG 
KKLIHCVRFD VP — FLICSN TPASVKLPKG 
NKLVHCTQLN VP — FLICSN KPEGKKLPDD 

VSFTTKEDS 

RDATQYLVQQ ESSFVMMSAP PAEYKLQQGT 



I I 

2045 
KLRSRVKFVN 
KYYSRVRSVR 
NYIGKWWK 
SVNVKVTQIK 
SNVKKVTDVT 
SNVKKVTDVT 
CNVSKVSEAK 
FIYKLTPDTD 
AHLTKMSEYK 



I 



I 



1 . 



2055 



2065 



. 1 



I I 

2015 
WCASVLKDG 
IVCASVKRDG 
WNASVLKLG 
WAAPLAIHG 
VGSANIFIGD 
VGSANIFKGD 
VVAANIFTGG 
KLPLTLKVRG 
FLCANEYTGN 

I 



1. 



2025 



I I I 

2035 

CDVGFCPHRH 

VQVGYCVHGI 

VEDGLCPHGL 

TDE-TCVHGV 

KVG-HYVHVK CEQSYQLYDA 
KVG-HYVHVK CEQSYQLYDA 
SLG-HYTHVK CKPKYQLYDA 

IK SVVDFRSKDG 

YQCGHYTHIT AKETLYRIDG 



2075 



.,1 I 

2085 



G 

G 

G 

GKLSDCLYLK NLKQTFKSVL TTYYLDDVKK lEYKPDLSQY 
GNLSDCLYLK NLKQTFKSVL TTYYLDDVKK lEYKPDLSQY 
GNFTDCLYLK NLKQTFSSKL TTFYLDDVKC VEYNPDLSQY 

EN S 

GPVTDVFYKE TSYTTTIKPV SYKLDGVTYT EIEPKLDGYY 



I I 

2095 
-RVVITNVGE 
-RAIIVSVEQ 
-TTIWNVGK 
-TVAITSLIG 
YCDGGKYYTQ 
YCDGGKYYTQ 
YCESGKYYTK 
-KAPVYYPVL 
KKDNAYYTEQ 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



I I 

2105 
PIISQPSKLL 
LEPCAQSRLL 
PVVAPSHLFL 

PUG EVL 

RIIKAQFKTF 
RIIKAQFKTF 
PIIKAQFRTF 
DAISLKAIWV 
PIDLVPTQPL 



I I 

2115 
NGIA — YTTF 
SGVA — YTAF 
KGVS — YTTF 
EATG — YICY 
EKVDGVYTNF 
EKVDGVYTNF 
EKVEGVYTNF 
EGNANFWGH 
PNAS — FDNF 



2125 



I 



S 

S 

LDN- 
S 



2135 

GSFD 

GPVD 

GNGV 

GSNR 



KLIGHTVCDS LNA-KLGFDS 
KLIGHTVCDI LNA-KLGFDS 
KLVGHSIAEK FNA-KLGFDC 

PN YYSKS 

KLTCSNTKFA DDLNQMTGFT 



I I 

2145 
NGHYWYDAA 
KGHYTVYDTA 
VGHYTVFDHG 
NGHYTYYDNR 
SKEFVEYKIT 
SKEFVEYKVT 
NSPFTEYKIT 
LHIPTFWENA 
KPASRELSVT 



I I 

2155 
NNAVYDGARL 
KKSMYDGDRF 
TGMVHDGDAF 
NGLWDAEKA 
EWPTATGDW 
EWPTATGDVV 
EWPTATGDW 
ENFVKMGDKI 
FFPDLNGDW 



I I I I |... 

2165 2175 2185 

FASD 

VKHD 

VPGD 

YHFN 

LATDDLYVKR YERGCITFGK PVIWLS— 
LATDDLYVKR YERGCITFGK PVIWLS-- 
LASDDLYVSR YSGGCVTFGK PVIWLG— 
GGVT 



..I I I I 

2195 2205 

LSTLAVTA 

LSLLSVTS 

— LNVSPVTN 

— RDLLQVTT 

— HEKASL NSLTYFNRPS 
--HEQASL NSLTYFNRPL 
— HEEASL KSLTYFNRPS 
MGLWRAEH 



AIDYRHYSAS FKKGAKLLHK PIVWHINQAT TKTTFKPNTW CLRCLWSTKP 



I I 

2215 
IWVGGCVTS 
VVMVGGYVA- 
VWSEQTAVV 
AIASNFVVKK 
LVDDNKFDVL 
LVDENKFDVL 
VVCENKFNVL 
LNKPNLERIF 
VDTSNSFEVL 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



2225 



2235 



2245 



2255 



I 



1 



2265 



2275 



PQAEER P — 

KVDOVD DGGDSSB SGAKE T 

KVDDVD DGGDISE SDAKE P 

PVDVSEPTDK GPVPAAVLVT GALSGAATAP GTAKEQKVCA SDSWDQWS GFLSDLSGAT 

N 

AVEDTQG 



MDNLACESQQ PTSEEVVEN- 
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I f I I ....1 I I I I 1 I I 

2285 2295 2305 2315 2325 2335 

EMCR NVPP IVSEKISVMD KLDTG 

22 9E PV NTVKPKPVIN QLDEK 

PEDV IKDP VKKAELDATK LLDTMNY 

TGEV KN CAFNKVAASP KIVQEQKLLA lESGANY 

OC43 KEINIIKLSG VKKPFKVEDS VIVNDDTSET KYVKSLSIVD VYDMWLTGCK YWRTANALS 

BOCOV KEINIIKLSG VKKPFKVEDS VIVNDDTSEI KYVKSLSIVD VYDMWLTGCR CWRTANALS 

MHV VDVKEVKLNG VKKPIKVEDS VVVNDPTSET KWKSLSIVD VYDMFLTGCR YVVWMANELS 

AIPV lAKK AIVGSSWTT QCGKLIG 

SAEIS CoV PTIQKEVIEC DVKTTEWGN VILKPSDEGV KVTQELGHED LMAAYVENTS ITIKKPNELS 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS Gov 



2345 

AQK 

AQK 

ASER 

ALTE 

RAVNVPTIRK 
RAVNVPTIRK 
RLVNSPTVRE 

K 

LALG LKT 



I I 

2355 
FFQFGDFVMN 
FFDFGDFLIH 
FFSFGDFMSR 
FGRYADMFFM 
FIKFGMTLVS 
FIKFGMTLVS 
YVKWGMTKIV 
AATFIADKVG 
lATHGIAAIN 



2365 



2375 



2385 



N 

N 

N 

A 

IPIDLLNLRE 
IPIDLLNLRE 
IPAKLVLLRD 

G 

SVPWSKILAY 



IV 

FV 

LI 

GD 

IKPAVNVVKA VRNKISVCFN 
IKPVFNVViCA VRNKISACFN 
EKQEFVAPKV VKAKVIACYS 

G 

VKPFLG — QA AITTSNCAKR 



I I 

2395 
LFLTWLLSMF 
IFFTWLLSMF 
TVFLYILSIL 
KILRLLLEVF 
FIKWLFVLLF 
FIKWLFVLLF 
AVKWFFLYCF 
WRNITDSIK 
LAQRVFNNYM 



I I I I I I I I I I I I 

2405 2415 2425 2435 2445 2455 

EMCR SLLRTSIMKH DIKVIAKAPK RTGVILTRSF KYNIRSALFV VKQKWC-VIV TLFKFLLLLY 

22 9E TLCKTAVTTG DVKIMAKAPQ RTGVVLKRSL KYNLKASAAV LKSKWW-LLA KFTKLLLLIY 

PEDV GLCFRAFRKR DVKVLAGVPQ RTGIILRKSM RYNAKALGVF FKLKLY-WFK VLGKFSLGIY 

TGEV KYLLVLFMCL RSTKMPKVKV KP-PLAFKDF GAKVRTLNYM RQLNKP-SVW RYAKLVLLLI 

OC43 GWIKISADNK VIYTTEIASK LTCKLVALAF KNAFLTFKWS MVARGA-CII ATIFLLWFNF 

BoCoV GWIKISADNK VIYTTEVASK LTCKLVALAF KNAFLTFKWS WARGA-CII ATIFLLWFNF 

MHV SWIKFNTDNK VIYTTEVASK LTFNLCCLAF KNALQTFNWN WSRGF-FLV ATVFLLWFNF 

AIPV GLCGITRGHF ERKMSPQFLK TLMFFLFYFL KASVKSWAS YKTVLCKWL ATLLIVWFVY 

SARS CoV PYVFTLLFQL CTFTKSTNSR IRASLPTTIA KNSVKSVAKL CLDAGI-NYV KSPKFSKLFT 



I I I I I I I I I I I I 

2465 2475 2485 2495 2505 2515 

EMCR AIYALVFMIV QFSPFNSL-L CGDIVSGYEK STF NK DIYCGNSMVC 

229E TLYSWLLCV RFGPFN F CSETVNGYAK SNF VK DDYCDGSLGC 

PEDV ALYALLFMTI RFTPIGSP-V CDDVVAGYAN SSF DK NEYCN-SVIC 

TGEV AIYNFFYLFV SIPVVHKL-T CNGAVQAYKN SSF IK SAVCGNSILC 

CX:43 lYANVIFSDF YLPKIGFLPT FVGKIAQWIK NTFSLVTICD LYSMQDVGFK NQYCNGSIAC 

BoCoV lYANVIFSDF YLPKIGFLPT FVGKIVQWIK NTFSLVTICD LYSIQDVGFK NQYCNGSIAC 

MHV LYANVILSDF YLPNIGFFPT FVGQIVAWVK TTFGIFTLCD LYQVSDVGYR SSFCNGSMVC 

AIPV TSNPVMFTGI RVLDFLFEGS LCGPYKDYGK DSFD VL R-YCADDFIC 

SARS CoV lAMWLLLLSI CLGSLICVTA AFGVLLSNFG APSYCNGVRE LYLNSSNVTT MDFCEGSFPC 



I I I I I I I I I.. ..I I I 

2525 2535 2545 2555 2565 2575 

EMCR KMCLFSYQEF NDLDHTSLVW KHIR DP ILISLQPFVI LVILLIFG 

229E KMCLFGYQEL SQFSHLDWW KHIT DP LFSNMQPFIV MVLLLIFG 

PEDV KVCLYGYQEL SDFSHTQVVW QHLR DP LIGNVMPFFY LAFLAIFG — 

TGEV KACLASYDEL ADFQHLQVTW DFKS DP LWNRLVQLSY FAFLAVFG 

OC43 QFCLAGFDML DNYKAIDVVQ YEAD RR AFVDYTGVLK IVIELIVSYA LYTAWFYPLF 

BoCoV QFCLAGFDML DNYKAIDVVQ YEAD RR AFVDYTGVLK IVIELIVSYA LYTAWFYPLF 

MHV ELCFSGFDML DNYDAINWQ HVVD RR VSFDYISLFK LVVELVIGYS LYTVCFYPLF 

AIPV RVCLHDKDSL HLYKHAYSVE QVYKDAASGF IFNWNWLYLV FLILFVKP — 

SARS COV SICLSGLDSL DSYPALETIQ VTISSYKLDL TILGLAAEWV LAYMLFTKFF YLLG L 



I ..I ....I.. ..I I. ...I I.... I 

2585 2595 2605 2615 2625 2635 

EMCR NMYLRFGLLY FVAQFISTFG SFLGFHQKQW FLHFVPFDVL CNEFLATFIV CKIVLFVRHI 
229E DNYLRCFLLY FVAQMISTVG VFLGYKETNW FLHFIPFDVI CDELLVTVIV IKVISFVRHV 
PEDV GVYVKAITLY FIFQYLNSLG VFLGLQQSIW FLQLVPFDVF GDEIVVFFIV TRVLMFIKHV 
TGEV NNYVRCFLMY FVSQYLNLWL SYFGYVEYSW FLHVVNFESI SAEFVIVVIV VKAVLALKHI 
OC4 3 ALISIQILTT WLPELFMLST LHWSFRLLVA LANMLPAHVF MRFYIIIASF IKLFSLFRHV 
BoCoV ALISIQILTT WLPELLMLST LHWSVRLLVS LANMLPAHVF MRFYIIIASF IKLFSLFRHV 
MHV GLIGMQLLTT WLPEFFMLET MHWSARFFVF VANMLPAFTL LRFYIWTAM YKIFCLCRHV 
AIPV VAGFVIICYC VKYLVLNSTV LQTGVCFLDW FVQTVFSHFN FMGAGFYFWL FYKIYIQVHH 
SARS CoV SAIMQVFFGY FASHFISNS WLMWFIIS IVQMAPVSAM VRMYIFFASF YYIWKSYVHI 



I I I I I I I I 1 I I I 

2645 2655 2665 2675 2685 2695 

EMCR IVGCNNADCV ACSKSARLKR VPLQTIINGM HKSFYVNANG GTCFCNKHNF FCVNCDSFGP 

229E LFGCENPDCI ACSKSARLKR FPVNTIVNGV QRSFYVNANG GSKFCKKHRF FCVDCDSYGY 

PEDV CLGCDKASCV ACSKSARLKR VPVQTIFQGT SKSFYVHANG GSKFCKKHNF FCLNCDSYGP 

TGEV VFACSNPSCK TCSRTARQTR IPIQVVVNGS MKTVYVHANG TGKFCKKHNF YCKNCDSYGF 

OC43 AYGCSKSGCL FCYKRNRSLR VKCSTIVGGM IRYYDVMANG GTGFCSKHQW NCIDCDSYKP 

BoCoV AYGCSKSGCL FCYKRNRSLR VKCSTIVGGM IRYYDVMANG GTGFCSKHQW NCIDCDSYKP 

MHV MYGCSRPGCL FCYKRNRSVR VKCSTWGGT LRYYDVMANG GTGFCAKHQW NCLNCSAFGP 

AIPV ILYCKDVTCE VCKRVARSNR QEVSVVVGGR KQIVHVYTNS GYNFCKRHNW YCRNCDDYGH 

SARS CoV MDGCTSSTCM MCYKRNRATR VECTTIVNGM KRSFYVYANG GRGFCKTHNW NCLNCDTFCT 
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2705 2715 2725 2735 2745 2755 

EMCR GNTFINGDIA RELGNVVKTA VQPTAPAYVI IDKVDFVNGF YRIiYSGDTFW RYDFDITESK 

229E GSTFITPEVS RELGNITKTN VQPTGPAYVM IDKVEFENGF YRLYSCETFW RYNFDITESK 

PEDV GCTFINDVIA TEVGNWKLN VQPTGPATIL IDKVEFSNGF YYLYSGDTFW KYNFDITDSK 

TGEV ENTFICDBIV RDLSNSVKQT VYATDRSHQE VTKVECSDGF YRFYVGDEFT SYDYDVKHKK 

OC43 GNTFITVEAA LDLSKELKRP ZQPTDVAYHT VTDVKQVGCS MRLFYDRDGQ RTYDDVNASL 

BoCoV GNTFITVEAA LDLSKELKRP IQPTDVAYHT VTDVKQVGCY MRLFYDRDGQ RTYDDVNASL 

MHV GNTFITHEAA ADLSKELKRP VNPTDSAYYL VTEVKQVGCS MRLFYERDGQ RVYDDVSASL 

AIPV QNTFMSPEVA GBLSEKLKRH VKPTAYAYHV VDEACLVDDF VNLKYKAATP GKDSASSAVK 

SARS CoV GSTFISDEVA RDLSLQFKRP INPTDQSSYI VDSVAVKNGA LHLYFDKAGQ KTYERHPLSH 



I I I I I.. ..I I I I I I I 

2765 2775 2785 2795 2805 2815 

EMCR YSCKEVLKN CNVLENFIVY NNSGS— NIT QIKNACVYFS QLLCEPIKLV 

22 9E YSCKEVFKN- CNVLDDFIVF NNNGT — NVT QVKNASVYFS QLLCRPIKLV 

PEDV YTCKEALKN CSIITDFIVF NNNGS— NVN QVKNACVYFS QMLCKPVKLV 

TGEV YSSQEVLKS- MLLLDDFIVY SPSGS — ALA NVRNACVYFS QLIGKPIKIV 

OC43 FVDYSNLLHS KV KSVPNMHVW VENDA — DKA NFLNAAVFYA QSLFRPILMV 

BoCoV FVDYSNLLHS KV KSVPNMHVW VENDA — DKA NFLNAAVFYA QSLFRPILMV 

MHV FVDMNGLLHS KV KGVPETHWV VENEA— DKA GFLNAAVFYA QSLYRPMLLV 

AIPV CFSVTDFLKK AVFLKEALKC EQISNDGFIV CNTQSAHALE EAKNAAIYYA QYLCKPILIL 
SARS CoV FVNLDNLRAN NT KGSLPINVIV FDGKSKCDES ASKSASVYYS QLMCQPILLL 



I I I I I.... I I I I 1 I I 

2825 2835 2845 2855 2865 2875 

EMCR NSELLSTLS- -VDFNGVLHK AYVDVLCNSF FKELTANMSM AECKATLGLT 

229E DSELLSTLS VDFNGVLHK AYIDVLRNSF GKDLNANMSL AECKRALGLS 

PEDV DSALLASLS- -VDFGASLHS AFVSVLSNSF GKDLSSCNDM QDCKSTLGFD D 

TGEV NSDLLEDLS- -VDFKGALFN AKKNVIKNSF NVDVSECKNL DECYRACNLN 

OC43 DKNLITTANT GTSVTETMFD VYVDTFLSMF DVDKKSLNAL lATAHSSIKQ GTQIYKVLDT 

BoCoV DKILITTANT GTSVTETMFD VYVDTFLSMF DVDKKSLNAL lATAHSSIKQ GTQICKVLDT 

MHV EKKLITTANT GL5VSQTMFD LYVDSLLGVL DVDRKSLTSF VNAAHNSLKE GVQLEQVMDT 

AIPV DQALYEQLW -EPVSKSVID KVCSILSSII SVDTAALNYK AGTLRDALLS 

SARS COV DQVLVSDVGD STEVSVKMFD AYVDTFSATF SVPMEKLKAL VATAHSELAK GVALDGVLST 



I I I I I.... I I I I 1 I I 

2885 2895 2905 2915 2925 2935 

EMCR VSDDDF VSAVANAHRY DVLLSDLSFN NFFISYAKPE DK-LSVYDIA 

22 9E ISDHEF TSAISNAHRC DVLLSDLSFN NFVSSYAKPE EK-LSAYDLA 

PEDV VPLDTP NAAVAEAHRY DVLLTDMSFN NFTTSYAKPE EK-FPVHDIA 

TGEV VSFSTF EMAVNNAHRF GILITDRSFN NFWPSKVKPG SSGVSAMDIG 

OC43 FLSCARKSCS IDSDVDTKCL ADSVMSAVSA GLELTDESCN NLVPTYLKSD N— IVAADLG 

BOCoV FLSCARKSCS IDSDVDTKCL ADSVMSAVSA GLELTDESCN NLVPTYLKGD N-- IVAADLG 

MHV FIGCARRKCA IDSDVETKSI TKSIMSAVNA GVDFTDESCN NLVPTYVKSD T— IVAADLG 

AIPV ITKDEEA VDMAIFCHNH DVDYTGDGFT NVIPSYGIDT G-KLTPRDRG 

SARS CoV FVSAARQG~V VDTDVDTKDV lECLKLSHHS DLEVTGDSCN NFMLTYNKVE N — MTPRDLG 



I 1 I I I I I 1 I I 1 I 

2945 2955 2965 2975 2985 2995 

EMCR CCMRAGSKW NHNVLIKESI PIVWGVKDFN TLSQEGKKYL VKTTKAKGLT FLLTFNDNQA 

22 9E CCMRAGAKW NANVLTKDQT PIVWHAKDFN SLSAEGRKYI VKTSKAKGLT FLLTINENQA 

PEDV TCMRVGAKIV NHNVLVKDSI PWWLVRDFI ALSEETRKYI IRTTKVKGIT FMLTFNDCRM 

TGEV KCMTSDAKIV NAKVLTQRGK SWWLSQDFA ALSSTAQKVL VKTFVEEGVN FSLTFNAVGS 

OC4 3 VLIQNSAKHV QGNVAKIAGV SCIWSVDAFN QFSSDFQHKL KKACCKTGLK LKLTYNKQMA 

BoCoV VLIQNSAKHV QGNVAKIAGV SCIWSVDAFN QLSSDFQHKL KKACCKTGLK LELTYNKQMA 

MHV VLIQNNAKHV QANVAKAANV ACIWSVDAFN QLSADLQHRL RKACSKTGLK IKLTYNKQEA 

AIPV FLINADASIA NLRVKN — AP PVVWKFSELI KLSDSCLKYL ISATVKSGVR FFITKSGAKQ 

SARS CoV ACIDCNARHI NAQVAKSHNV SLIWNVKDYM SLSEQLRKQI RSAAKKNNIP FRLTCATTRQ 



I I I I I I I I I I I I 

3005 3015 3025 3035 3045 3055 

EMCR ITQVP A TSIVAKQGAG FKRTYNFLWY VCLFVVALFI GVSFID 

22 9E VTQIP A TSIVAKQGAG D AGHSLTWLWL LCGLVCLIQF YLCFFMPY — 

PEDV HTTIP T VCIANKKGAG LP S FSKVKKFFWF LCLFIVAAFF ALSFLD 

TGEV DDDLPYERFT ESVSPKSGSG FFDVITQLKQ IVILVFVFIF ICGLCSVYSV 

OC43 NVSVL T TPFSLKGGAV FS Y FVYVCFVLSL VCFIGLWCLM PTYTVH 

BoCoV NVSVL T TPFSLKGGAV FS Y FVYVCFVLSL VCFIGLWCLM PTYTVH 

MHV NVPIL T TPFSLKGGAV FS K VLQWLFWNL ICFIVLWALM PTYAVH 

AIPV VIACHT—QK LLVEKKAGGI VSGTFKCFKS YFKWLLIFYI LFTACCSGYY YMEVSKSFVH 

SARS CoV WNVI T TKISLKGGKI VS T CFKLMLKATL LCVLAALVCY IVMPVHTLS- 



I I I I I, ...I I.... I I I I I 

3065 3075 3085 3095 3105 3115 

EMCR -YTTTVTSFH GYDFKYIENG QLKVFEAPLH CVRNVFDNFN QWHEAKFGVV TTNSD-KCPI 

22 9E FMYDIVSSFE GYDFKYIENG QLKNFEAPLK CVRNVFENFE DWHYAKFGFT PLNKQ-SCPI 

PEDV -FSTQVSSDS DYDFKYIESG QLKTFDNPLS CVHNVFINFD QWHDAKFGFT PVNNP-SCPI 

TGEV ATQSYIESAE GYDYMVIKNG IVQPFDDTIS CVHNTYKGFG DWFKAKYGFI PTFGK-SCPI 

OC43 — KSDFQLPV YASYKVLDNG VIRDVSVEDV CFANKFEQFD QWYESTFGLS YYSNSMACPI 

BoCoV —KSDFQLPV YASYKVLDNG VIRDVSVEDV CFANKFEQFD QWYESTFGLS YYSNSMACPI 

MHV — KSDMQLPL YASFKVIDNG VLRDVTVTDA CFANKFIQFD QWYESTFGLV YYRNSRACPV 

AIPV PMYDVNSTLH VEGFKVIDKG VLREIVPEDT CFSNKFVNFD AFWG RP YDNSR-NCPI 

SARS CoV — IHDGYTNE IIGYKAIQDG VTRDIISTDD CFANKHAGFD AWFSQRGGSY KNDKS— CPV 
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PEDV 

TGEV 

OC43 

BoCoV 
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I I 

3125 

WG VSER 

VVG VSEI 

WG VSDE 

VVGTVFDLEN 
VVA-VIDQDF 
VVA-WDQDF 
VVA-VIDQDI 
VTA — VIDGD 
VAA-IITREI 



I I 

3135 
INVVPGVPTN 
VNTVAGIPSN 
ARTVPGIPAG 
MRPIPDVPAY 
GSTVFNVPTK 
GSTVFNVPTK 
GYTLFNVPTK 
GTVATGVPGF 
GFIVPGLPGT 



..I I 

3155 

KTLV 

KTLI 

KTLV 

RSLV 

YHVL 

YHVL 

FHVL 

VSWVMDGVMF IHMTQTERKP 
VLRAIN GDFL 



I I 

3145 

VYLVG 

VYLVG 

VYLAG 

VSIVG 

VLRYG 

VLRYG 

VLRYG 



I I 

3165 
FTLQAAFGNT 
FTLQAAFGNA 
FAINTIFGTS 
FAINAAFGVT 
HFITHALSAD 
HFITHALSAD 
HFITHAFATD 
WYIPTWFNRE 
HFLPRVFSAV 



I I 

3175 
GVCYDFDGVT 
GVCYDIFGVT 
GLCFDASGVA 
NMCYDHTGNA 
GVQCYTPHSQ 
GVQCYTPHSQ 
SVQCYTPHMQ 
IVGYTQDSII 
GNICYTPSKL 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS Gov 



I I 

3185 

TS DK 

TP EK 

DK GA 

VSKD-SYFDT 
ISYSNFYASG 
ISYSNFYASG 
IPYDNFYASG 
TEG-SFYTSI 
lEYSDFATSA 





3195 
CIFNSACTRL 
CIFTSACTRL 
CIFNSACTTL 
CVFNTACTTL 
CVLSSACTMF 
CVLSSACTMF 
CVLSSLCTML 
ALFSARCLYL 
CVLAAECTIF 



I I 

3205 
EGLGGD-NVY 
EGLGGN-NVY 
SGLGGT-AVY 
TGLGGT-IVY 
TMADGSPQPY 
AMADGSPQPY 
AHADGTPHPY 
TASNTP-QLY 
KDAMGKPVPY 



.,..1 I 

3215 
CYN-TDLIEG 
CYN-TALMEG 
CYK-NGLVEG 
CAK-QGLVEG 
CYT-EGLMQN 
CYT-DGLMQN 
CYT-EGIMHN 
CFNGDNDAPG 
CYD-TNLLEG 



1 I 

3225 
SKPYSILQPN 
SLPYSSIQAN 
AKLYSELAPH 
AKLYSDLMPD 
ASLYSSLVPH 
ASLYSSLVPH 
ASLYDSLAPH 
ALPFGSIIPH 
SISYSELRPD 



I I 

3235 
AYYKYDVKN- 
AYYKYDNGN- 
SYYKMVDGN- 
YYYEHASGN- 
VRYNLANAKG 
VRYNLANAKG 
VRYNLANSNG 
RVYFQPNGVR 
TRYVLMDGS- 



EMCR 

22 9E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS Gov 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BOCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



I I 

3245 
YVRFPEILAR 
FIKLPEVIAQ 
AVSLPEIISR 
MVKLPAIIR- 
FIRFPEVLRE 
FIRLPEVLRE 
YIRFPEWSE 
LIVPQQILHT 
IIQFPNTYLE 



I \ 

3255 
GFGLRTIRTL 
GFGFRTVRTI 
GFGIRTIRTK 
GLGLRFVKTQ 
GL-VRIVRTR 
GL-VRIVRTR 
GI-VRIVRTR 

PY WKFV 

GS-VRWTTF 

( I 



3265 
ATRYCRVGEC 
ATKYCRVGEC 
AMTYCRVGQC 
ATTYCRVGEC 
SMSYCRVGLC 
SMSYCRVGLC 
SMTYCRVGLC 
SDSYCRGSVC 
DAEYCRHGTC 

I I 



I I 

3275 
RDSHKGVCFG 
VESNAGVCFG 
VQSAEGVCFG 
IDSKAGFCFG 
EEADEGICFN 
EEADEGICFN 
EDAEEGVCFN 
EYTRPGYCVS 
ERSEVGICLS 

1 . 



3285 
FDKWYVNDGR 
FDKWFVNDGR 
ADRFFVYNAE 
GDNWFVYDNE 
FNGSWVLNND 
FNGSWVLNND 
FNSSWVLNNP 
LNPQWVLFND 
TSGRWVLNNE 



I I 

3295 

VD DGYIC 

VA NGYVC 

SG SDFVC 

FG NGYIC 

YYRSLPGTFC 
YYRSLPGTFC 
YYRAMPGTFC 
EYTSKPGVFC 
HYRALSGVFC 



3305 
GDGLIDLLVN 
GTGLWNLVFN 
GTGLFTLLMN 
GNSVLGFFKN 
GRDVFDLIYQ 
GRDVFDLIYQ 
GRNAFDLIHQ 
GSTVRELMFS 
GVDAMNLIAN 

I 



3365 
TVVCATLINN 
TVVVAVLLNN 
TVGACTLLNN 
MIIVTLWNN 
VNVIVWCVNF 
VNVIVWCVNF 
INVIVWCINF 
ITMLVWVINA 
ANALLFLMSF 



3315 
VLSIFSSSFS 
ILSMFSSSFS 
VISVFSKTVP 
VFKLFNSNMS 
LFKGLAQPVD 
LFKGLAQPVD 
VLGGLVRPID 
MVSTFFTGVN 
IFTPLVQPVG 

I. 



3325 
WAMSGHMLF 
VAAMSGQILL 
VTVLSGQILF 
WATSGAMLV 
FLALTASSIA 
FLALTASSIA 
FFALTASSVA 
-PNIYMQLAT 
ALDVSASVVA 



3335 
NFLFAAFITF 
NCALGAFAIF 
NCIIAFVAVA 
NIIIACLAIA 
GAILAVIVVL 
GAILAVIWL 
GAILAIIWL 
MFLILWWL 
GGIIAILVTC 



1 I 

3345 
LCFLVTKFKR 
CCFLVTKFRR 
VCFLFTKFKR 
MCYGVLKFKK 
VFYYLIKLKR 
GFYYLIKLKR 
AFYYLIKLKR 
IFAMVIKFQG 
AAYYFMKFRR 



3375 
ISYVVTQN-L 
VSYIVTQN-L 
VSYIVTQN-T 
VSYFVTQN-T 
MMLFVFQVYP 
MMLFVFQVYP 
LMLFVFQVYP 
FILCVHSYNS 
TILCLVPAYS 



I I 

3385 
FFMLLYAILY 
VTMIAYAILY 
LGMLGYATLY 
FFMIIYAIVY 
ILSCVYAICY 
TLSCVYAICY 
TLSCLYACFY 
VLAVILLVLY 
FLPGVYSVFY 



I I 

3395 
FVFTRTVR — 
FFATRSLR— 
FLCTKGVR — 
YFITRKLA — 
FYATLYFPSE 
FYATLYFPSE 
FYTTLYFPSE 
CYASLVTSRN 
LYLTFYFTND 



I I 

3405 
YAWIMHIAYI 
YAWIWCAAYL 
YMWIWHLGFL 
YPGILDAGFI 
ISVIMHLQWL 
ISVIMHLQWL 
ISWMHLQWL 
TVIIMHCWLV 
VSFLAHLQWF 



I I 

3355 
VFGDLSYGVF 
MFGDLSVGVC 
MFGDMSVGVF 
IFGDCTFLIV 
AFGDYTSVVF 
AFGDYTSIVF 
AFGDYTSWV 
VFKAYATTVF 
VFGEYNHVVA 

\ 



3415 
VAYFLLIPWW 
lAYISFAPWW 
ISYILIAPWW 
lAYINMAPWY 
VMYGTIMPLW 
VMYGTIMPLW 
VMYGAIMPLW 
FTFGLIVPTW 
AMFSPIVPFW 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229B 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



I I 

3425 
LLTWFSFAAF 
LCAWYFLAML 
VLMVYAFSAI 
VITAYILVFL 
FCLLYIAWV 
FCLLYISWV 
FCIIYVAWV 
LACCYLGFII 
ITAIYVFCIS 

I 



I I 

3435 
LELLPNVFKL 
TGLLPSLLKL 
FEFMPNLFKL 
YDSLPSLFKL 
SN — HAFWVF 
SN — HAFWVF 
SN — HALWLF 
YMYTPLFLWC 
LKHCHWFFNN 

I I 



I 



3485 
T— ISPEKLK 
S— ISPEKLK 
S— ISTEKLR 
S— TSIARIK 
S— LSDVAFN 
S — LSDVAFN 
S— VSDVAFN 
E— i-GDKFE 
ETLLPLTQYN 



3445 

K ISTQL 

K VSTNL 

K VSTQL 

K VSTNL 

S YCRKL 

S YCRQL 

S YCRKL 

YGTTKNTRKL 
Y LRKRV 

I I 



3455 
FEGDKFIGTF 
FEGDKFVGTF 
FEGDKFVGSF 
FEGDKFVGNF 
GTSVRSDGTF 
GTSVRSDGTF 
GTEVRSDGTF 
YDGNEFVGNY 
MFNGVTFSTF 



3495 
NYAASYNKYK 
SYAASYNRYK 
QYASTYNKYK 
SYANSFNKYK 
RYLSLYNKYR 
RYLSLYNKYR 
RYLSLYNKYR 
AYLSAYARLK 
RYLALYNKYK 



3505 
YYSGSASEAD 
YYSGNANEAD 
YYSGSASEAD 
YYTGSMGEAD 
YYSGKMDTAA 
YYSGKMDTAA 
YFSGKMDTAA 
YYSGTGSEQD 
YFSGALDTTS 



I I 

3515 
YRCACYAHLA 
YRCACYAYLA 
YRLACFAHLA 
YRMACYAHLG 
YREAACSQLA 
YREAACSQLA 
YREAACSQLA 
YLQACRAWLA 
YREAACCHLA 



I I 

3465 
ESAAAGTFVL 
ESAAAGTFVI 
ENAAAGTFVL 
ESAAMGTFVI 
EEMALTTFMI 
EEMALTTFMI 
EEMSLTTFMI 
DLAAKSTFVI 
EEAALCTFLL 

I 1 



1 



3475 
DMRSYERLIN 
DMRSYEKLAN 
DMHAYERLAN 
DMRSYETIVN 
TKDSYCKLKN 
TKDSYCKLKN 
TKESYCKLKN 
RGSEFVKLTN 
NKEMYLKLRS 

I I 



3525 
KAMLDYAKDH 
KAMLDFSRDH 
KAMMDYASNH 
KALMDYSVNR 
KAMDTFTNNN 
KAMDTFTNNN 
KAMETFNHNN 
YALDQYR-NS 
KALNDFS-NS 



3535 
N-DMLYSPPT 
N-DILYTPPT 
N-DTLYTPPT 
T-DMLYTPPT 
GSDVLYQPPT 
GSDVLYQPPT 
GNDVLYQPPT 
GVEIVYTPPR 
GADVLYQPPQ 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



49/87 



PCT/NL2004/000805 



I I I I I I I I I I I I 

3545 3555 3565 3575 3585 3595 

EMCR ISYN-STLQS GLKKMAQPSG CVERCVVRVC YGSTVLNGVW LGDTVTCPRH VIAPS-TTVL 

229E VSYG-STLQA GLRKMAQPSG FVEKCWRVC YGNTVLNGLW LGDIVYCPRH VIASN-TTSA 

PEDV VSYN-STLQA GLRKMAQPSG WEKCIVRVC YGMMALNGLW LGDIVMCPRH VIASS-TTST 

TGEV VSVN-STLQS GLRKMAQPSG LVEPCIVRVS YGNNVLNGLW LGDEVICPRH VIASD-TTRV 

OC43 ASVSTSFLQS GIVKMVNPTS KVEPCVVSVT YGNMTLNGLW LDDKVYCPRH VICSASDMTN 

BoCoV ASVSTSFLQS GIVKMVNPTS KVEPCIVSVT YGNMTLNGLW LDDKVYCPRH VICSASDMTN 

MHV ASVTTSFLQS GIVKMVFPTS KVEPCVVSVT YGNMTLNGLW LDDKVYCPRH VICSSADMTD 

AIPV YSIGVSRLQS GFKKLVSPSS AVEKCIVSVS YRGNNLNGLW LGDTIYCPRH VLG KFSG 

SARS CoV TSITSAVLQS GFRKMAFPSG KVEGCMVQVT CGTTTLNGLW LDDTVYCPRH VICTAEDMLN 

I.... I I I I I ....I I I I I I 

3605 3615 3625 3635 3645 3655 

EMCR IDYDHAYSTM RLHNFSVSHN G-VFLGVVGV TMHGSVLRIK VSQSNVHTPK HVFKTLKPGA 

229E IDYDHEYSIM RLHNFSIISG T-AFLGWGA TMHGVTLKIK VSQTNMHTPR HSFRTLKSGE 

PEDV IDYDYALSVL RLHNFSISSG N-VFLGWSA TMRGALLQIK VNQNNVHTPK YTYRTVRPGE 

TGEV INYENEMSSV RLHNFSVSKN N-VFLGVVSA RYKGVNLVLK VNQVNPNTPE HKFKSIKAGE 

OC43 PDYTNLLCRV TSSDFTVLFD R-LSLTVMSY QMRGCMLVLT VTLQNSRTPK YTFGVVKPGE 

BoCoV PDYTNLLCRV TSSDFTVLFD R-LSLTVMSY QMQGCMLVLT VTLQNSRTPK YTFGVVKPGE 

MHV PDYSNLLCRV ISSDFCVMSG R-MSLTVMSY QMQGSLLVLT VTLQNPNTPK YSFGVVKPGE 

AIPV DQWNDVLNLA NNHEFEVTTQ HGVTLNWSR RLKGAVLILQ TAVANAETPK YKFIKANCGD 

SARS CoV PNYEDLLIRK SNHSFLVQAG N-VQLRVIGH SMQNCLLRLK VDTSNPKTPK YKFVRIQPGQ 

I.,.. I I I I I ....I I I I I I 

3665 3675 3685 3695 3705 3715 

EMCR SFNILACYEG lASGVFGVNL RTNFTIKGSF INGACGSPGY NVRNDGTVEF CYLHQIELGS 

229E GFNILACYDG CAQGVFGVNM RTNWTIRGSF INGACGSPGY NLKN-GEVEF VYMHQIELGS 

PEDV SFNILACYDG AAAGVYGVNM RSNYTIRGSF INGACGSPGY NINN-GTVEF CYLHQLELGS 

TGEV SFNILACYEG CPGSVYGVNM RSQGTIKGSF lAGTCGSVGY VLEN-GILYF VYMHHLELGN 

OC43 TFTVLAAYNG KPQGAFHVTM RSSYTIKGSF LCGSCGSVGY VIMG-DCVKF VYMHQLELST 

BoCoV TFTVLAAYNG KPQGAFHVTM RSSYTIKGSF LCGSCGSVGY VIMG-DCVKF VYMHQLELST 

MHV TFTVLAAYNG KSQGAFHVTM RSSYTIKGSF LCGSCGSVGY VLTG-DSVRF VYMHQLELST 

AIPV SFTIACAYGG TWGLYPVTM RSNGTIRASF LAGACGSVGF NIEK-GWNF FYMHHLELPN 

SARS CoV TFSVLACYNG SPSGVYQCAM RPNHTIKGSF LNGSCGSVGF NIDY-DCVSF CYMHHMELPT 

1 I I I I I I I I I I I 

3725 3735 3745 3755 3765 3775 

EMCR GAHVGSDFTG SVYGNFDDQP SLQVESANLM LSDNVVAFLY AALLNGCR WWLRST 

229E GSHVGSSFDG VMYGGFEDQP NLQVESANQM LTVNVVAFLY AAILNGCT WWLKGE 

PEDV GCHVGSDLDG VMYGGYEDQP TLQVEGASSL FTENVLAFLY AALINGST WWLSSS 

TGEV GSHVGSNFEG EMYGGYEDQP SMQLEGTNVM SSDNWAFLY AALINGER WFVTNT 

OC43 GCHTGTDFNG DFYGPYKDAQ WQLLIQDYI QSVNFVAWLY AAILNNCN WFVQSD 

BoCoV GCHTGTDFNG DFYGPYKDAQ VVQLPVQDYI QSVNFVAWLY AAILNNCN — WFVQSD 

MHV GCHTGTDFSG NFYGPYRDAQ WQLPVQDYT QTVNVVAWLY AAILNRCN — WFVQSD 

AIPV ALHTGTDLMG EFYGGYVDEE VAQRVPPDNL VTNNIVAWLY AAIISVKESS FSLPKWLEST 
SARS CoV GVHAGTDLEG KFYGPFVDRQ TAQAAGTDTT ITLNVLAWLY AAVINGDR — WFLNRF 

I I I I I I I I 1 I I I 

3785 3795 3805 3815 3825 3835 

EMCR RVNVDGFNEW AMANGYTIVS SV— ECYSIL AAKTGVSVEQ LLASIQHLHE -GFGGKNILG 

229E KLFVEHYNEW AQANGFTAMN GE — DAFSIL AAKTGVCVER LLHAIQVLNN -GFGGKQILG 

PEDV RIAVDRFNEW AVHNGMTTVG NT — DCFSIL AAKTGVDVQR LLASIQSLHK -NFGGKQILG 

TGEV SMSLESYNTW AKTNSFTELS ST — DAFSML AAKTGQSVEK LLDSIVRLNK -GFGGRTILS 

OC43 KCSVEDFNVW ALSNGFSQVK SD — LVIDAL ASMTGVSLET LLAAIKRLKN -GFQGRQIMG 

BoCoV KCSVEDFNVW ALSNGFSQVK SD — LVIDAL ASMTGVSLET LLAAIKRLKN -GFQGRQIMG 

MHV SCSLEEFNVW AMTNGFSSIK AD — LVLDAL ASMTGVTVEQ ILAAIKRLYS -GFQGKQILG 

AIPV TVSVDDYNKW AGDNGFTPFS TS — TAITKL SAITGVDVCK LLRTIMVKNS -QWGGDPILG 

SARS CoV TTTLNDFNLV AMKYNYEPLT QDHVDILGPL SAQTGIAVLD MCAALKELLQ NGMNGRTILG 

I I 1 I I I I I I I I I 

3845 3855 3865 3875 3885 3895 

EMCR YSSLCDEFTL AEVVKQMYGV NLQSGK V IFGLKTMFLF SVFFTMFWAE LFIYTNTIWI 

22 9E YSSLNDEFSI NEVVKQMFGV NLQSGK T TSMFKSISLF AGFFVMFWAE LFVYTTTIWV 

PEDV HTSLTDEFTT GEWRQMYGV NLQGGY V SRACRNVLLV GSFLTFFWSE LVSYTKFFWV 

TGEV YGSLCDEFTP TEVIRQMYGV NLQAGK V KSFFYPIMTA MTILFAFWLE FFMYTPFTWI 

OC43 SCSFEDELTP SDVYQQLAGI KLQSKRTRLF KGTVCWIMAS TFLFSCIITA FVKWTMFMYV 

BoCoV SCSFEDELTP SDVYQQLAGI KLQSKRTRLV KGIVCWIMAS TFLFSCIITA FVKWTMFMYV 

MHV SCVLEDELTP SDVYQQLAGV KLQSKRTRW KGTCCWILAS TLLFCSIISA FVKWTMFMYV 

AIPV QYNFBDELTP ESVFNQIGGV RLQSSFVR— K— ATSWFWS RCVLACFLFV LCAIVLFTAV 

SARS CoV STILEDEFTP FDVVRQCSGV TFQGKFKKIV KGTHHWMLLT FLTSLLILVQ STQWSLFFFV 

I I I I 1 I I I I I I I 

3905 3915 3925 3935 3945 3955 
EMCR NPVILTPIFC LLLFLSLVLT MFLKHKFLFL QVFLLPTVIA TALYNC-VLD YYIVKFLADH 
22 9E NPGFLTPFMI LLVALSLCLT FVVKHKVLFL QVFLLPSIIV AAIQNC-AWD YHVTKVLAEK 
PEDV NPGYVTPMFA CLSLLSSLLM FTLKHKTLFF QVFLIPALIV TSCINL-AFD VEVYNYLABH 
TGEV NPTFVSIVLA VTTLISTVFV SGIKHKMLFF MSFVLPSVIL VTAHNL-FWD FSYYESLQSI 
OC43 TTNMFSITFC ALCVIS-LAM LLVKHKHLYL TMYITP-VLF TLLYNN-YLV VYKHTFRGYV 
BoCoV TTNMLSITFC ALCVIS-LJ^ LLVKHKHLYL TMYIIP-VLF TLLYNN-YLV VYKQTFRGYV 
MHV TTHMLGVTLC ALCFVS-FAM LLVKHKHLYL TMFIMP-VLC TLFYTN-YLV VYKQSFRGLA 
AIPV PLKFYVYAAV ILLMAVLFIS FTVKHVMAYM DTFLLPTLIT VIIGVCAEVP FIYNTLISQV 
SARS CoV YENAFLPFTL GIMAIAACAM LLVKHKHAFL CLFLLPSLAT VAYFN MV YMPASWVMRI 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 
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PCT/NL2004/000805 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BOCOV 

MHV 

AIPV 

SARS CoV 



3965 
FN-YNVSVLQ 
FD-YNVSVMQ 
FD-YHVSLMG 
VENTNTMFLP 

YAWLS yy VPS 

YAWLSYYVPS 
YAWLSHFVPA 
VIFLSQWYDP 
MTWLELADTS 



I I 

3975 
MDVQGLVNVL 
MDIQGFVNIF 
FNAQGLVNIF 
VDMQGVMLTV 
VEYTYTDEVI 
VEYTYTDEVI 
VDYTYMDEVL 
WFDTMVPWM 
LSGYRLKDCV 



I. ...I 

3985 
VCLFWFLH- 
ICLFVALLH- 
VCFVVTILHG 
FCFIVFVTYS 
YGMLLLVGMV 
YGMLLLIGMV 
YGWLLV/y^V 
FLPLVLYTAF 
MYASALVLLI 



I 



3995 
— TWRFSKER 
— TWRFAKER 
TYTWRFFN-T 
VRFFTCKQSW 
FVTLRSINHD 
FVTLRSINHD 
FVTMRSINHD 
KCVQGCYMNS 
LMTARTVYDD 



4005 
FTHWFTYVCS 
CTHWCTYLFS 
PASSVTYWA 
FSLAVTTILV 
LFSFIMFVGR 
LFSFIMFVGR 
VFSVMFLVGR 
FNTSLLMLYQ 
AARRVWTLMN 



I I 

4015 
LIAVAYTYFY 
LIAVLYTALY 
LLTAAYNYFY 
IFNMVKIFGT 
LISVFSLWYK 
VISWSLWYM 
LVSLVSMWYF 
FVKLGFVIYT 
VITLVYKVYY 



I 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



4025 

SGD 

SYD 

ASD 

SDEPWTENQI 

GSN 

GSN 

GAN 



4035 

FLSL 

YVSL 

ILSC 

— AFCFVNM 

LEEEI 

LEEEI 

LEEEV 



SSNTLTAYTE GNWELFFELV 
GNALD QAISM 



I I 

4045 
LVMFLCAISS 
LVMLLCAISN 
AMTLFASVTG 
LTMIVSLTTK 
LLMLASLFGT 
LLMLASLFGT 
LLFLTSLFGT 
HTTVLANVSS 
WALVISVTSN 



I I 

4055 
DWYIGAIVFR 
EWYIGAIIFR 
NWFVGAVCYK 
DWMWIASYR 
YTWTTVLSMA 
YTWTTALSMA 
YTWTTMLSLA 
NSLIGLFVFK 
YSGWTTIMF 



I I 

4065 
LSRLIIFFSP 
ICRFGVAFLP 
VAVYMALRFP 
lAYYIVVCVM 
VAKVIAKWVA 
AAKVIAKWVA 
TAKVIAKWLA 
CAKWMLYYCN 
lARAIVFVCV 



I I 

4075 

E SVFSVF 

V EYVSYF 

TFVAIF 

P-S-AFVSDF 
VNV-LYFTDI 
VNV-LYFTDI 
VNV-LYFTDV 

ATYL 

EYYPLLFITG 



I I 

4085 
GDVKLTLVVY 
DGVKTVLLFY 
GDIKSVMFCY 
GFMKCISIVY 
PQIKIVLLCY 
PQIKIVLVCY 
PQVKLVLLSY 
NNYVLMAVMV 
NTLQCIMLVY 

1 I 



I 



4095 
LICGYLVCTY 
MLLGFVSCMY 
LVLGYFTCCF 
MACGYLFCCY 
LFIGYIISCY 
LFIGYIISCY 
LCIGYVCCCY 
NCIGWLCTCY 
CFLGYCCCCY 

I I 



I I 

4105 
WGILYWFNRF 
YGLLYWINRF 
YGILYWFNRF 
YGILYWVNRF 
WGLFSLMNSL 
WGLFSLMNSL 
W6VLSLLNSI 
FGLYWWVNKV 
FGLFCLLNRY 



I I 

4115 
FKCTMGVYDF 
CKCTLGVYDF 
FKVSVGVYDY 
TCMTCGVYQF 
FRMPLGVYNY 
FRMPLGVYNY 
FRMPLGVYNY 
FGLTLGKYNF 
FRLTLGVYDY 



I I 

4125 
KVSAAEFKYM 
CVSPAEFKYM 
TVSMEFKYM 
TVSAAELKYM 
KISVQELRYM 
KISVQELRYM 
KISVQELRYM 
KVSVDQYRYM 
LVSTQEFRYM 



4135 
VANGLHAPYG 
VANGLNAPNG 
VANGLRAPTG 
TANNLSAPKN 
NANGLRPPKN 
NANGLRPPKN 
NANGLRPPRM 
CLHKINPPKT 
NSQGLLPPKS 



4145 
PFDALWLSFK 
PFDALFLSFK 
TLDSLLLSAK 
AYDAMILSAK 
SFEALMLNFK 
SFEALMLNFK 
SFEALVLNFK 
VWEVFSTNIL 
SIOAFKLNIK 



4155 
LLGIGGDRCI 
LMGIGGPRTI 
LIGIGGERNI 
LIGVGGKRNI 
LLGIGGVPII 
LLGIGGVPII 
LLGIGGVPVI 
IQGIGGDRVL 
LLGIGGKPCI 





4165 
KISTVQSKLT 
KVSTVQSKLT 
KISSVQSKLT 
KISTVQSKLT 
EVSQFQSKLT 
EVSQFQSKLT 
EVSQIQSRLT 
PIATVQAKLS 
KVATVQSKMS 



I I 

4175 
DLKCTNVVLL 
DLKCTNVVLM 
DIKCSNVVLL 
EHKCTNWLL 
DVKCANWLL 
DVKCANGGLL 
DVKCVNVVLL 
DVKCTTWLM 
DVKCTSVVLL 



I I 

4185 
GCLSSMNIAA 
GILSNMNIAS 
GCLSSMNVSA 
GLLSKMHVES 
NCLQHLHVAS 
NCLQHLHVAS 
NCLQHLHIAS 
QLLTKLNVEA 
SVLQQLRVES 



I I 

4195 
NSSEWAYCVD 
NSKEWAYCVE 
NSTEWAYCVD 
NSKEWNYCVG 
NSKLWHYCST 
NSKLWQYCST 
SSKLWQYCST 
NSKMHVYLVE 
SSKLWAQCVQ 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS COV 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 



I I I 1 i I I I I I I I 

4205 4215 4225 4235 4245 4255 

LHNKINLCDD PEKAQGMLLA LLAFFLSKHS DFG L DGLIDSYFDN SSTLQSVASS 

MHNKINLCDD PETAQELLLA LLAFFLSKHS DFG L GDLVDSYFEN DSILQSVASS 

LHNKINLCND PEKAQEMLLA LLAFFLSKNS AFG L DDLLESYFND NSMLQSVAST 

LHNEINLCDD PEIVLEKLLA LIAFFLSKHN TCD L SELIESYFEN TTILQSVASA 

LHNEILATSD LSVAFEKLAQ LLIVLFANPA AVDSKCLTSI EEVCDDYAKD NTVLQALQSE 
LHNEILATSD LGVAFEKLAQ LLIVLFANPA AVDSKCLTSI EEVCDDYAKD NTVLQALQSE 
LHNEILATSD LSVAFDKLAQ LLVVLFANPA AVDSKCLASI EEVSDDYVRD STVLQALQSE 

LHNKILASDD VGECMDNLLG MLITLFCIDS TID L SEYCDDILKR STVLQSVTQE 

LHNDILLARD TTEAFEKMVS LLSVLLSMQG AVD 1 NRLCEEMLDN RATLQAIASE 

....I. ...I ....I.... I ..I I. ...I I. ...I 

4265 4275 4285 4295 4305 4315 

FVSMPSYIAY ENARQAYEDA lANGSS SQLIKQLKRA MNIAKSEFDH EISVQKKINR 

FVGMPSFVAY ETARQEYENA VANGSS PQIIKQLKKA MNVAKAEFDR ESSVQKKINR 

YVGLPSYVIY ENARQQYEDA VNNGSP PQLVKQLRHA MNVAKSEFDR EASTQRKLDR 

YAALPSWIAL EKARADLEEA KKNDVS PQILKQLTKA FNIAKSDFER EASVQKKLDK 

FVNMASFVEY EVAKKNLDEA RFSGSAN QQQLKQLEKA CNIAKSAYER DRAVAKKLER 

FVNMASFVEY EVAKKNLDEA CSSGSAN QQQLKQLEKA CNIAKSAYER DRAVARKLER 

FVNMASFVEY ELAKKNLDEA KASGSAN QQQIKQLEKA CNIAKSAYER DRAVARKLER 

FSHIPSYAEY ERAKNLYEKV LVDSKNGGVT QQELAAYRKA ANIAKSVFDR DLAVQKKLDS 
FSSLPSYAAY ATAQEAYEQA VANGDS EWLKKLKKS LNVAKSEFDR DAAMQRKLEK 



I 



4325 
MAEQAATQMY 
MAEQAAAAMY 
MAEQAAAQMY 
MAEQAAASMY 
MADLALTNMY 
MADLALTNMY 
MADLALTNMY 
MAERAMTTMY 
MADQAMTQMY 



I I 

4335 
KEARSVNRKS 
KEARAVNRKS 
KEARAVNRKS 
KEARAVDRKS 
KEARINDKKS 
KEARINDKKS 
KEARINDKKS 
KEARVTDRRA 
KQARSEDKRA 



I I 

4345 
KVISAMHSLL 
KVVSAMHSLL 
KVVSAMHSLL 
KIVSAMHSLL 
KWSALQTML 
KWSALQTML 
KWSALQTML 
KLVSSLHALL 
KVTSAMQTML 



I 1 

4355 
FGMLRRLDMS 
FGMLRRLDMS 
FGMLRRLDMS 
FGMLKKLDMS 
FSMVRKLDNQ 
FSMVRKLDNQ 
FSMIRKLDNQ 
FSMLKKIDSE 
FTMLRKLDND 



I I 

4365 
SVETVLNLAR 
SVDTILNMAR 
SVDTILNLAK 
SVNTIIDQAR 
ALNSILDNAV 
ALNSILDNAV 
ALNSILDNAV 
KLNVLFDQAS 
ALNNIINNAR 



I I 

4375 
DGVVPLSVIP 
NGVVPLSVIP 
DGVVPLSVIP 
NGVLPLSIIP 
KGCVPLNAIP 
KGCVPLNAIP 
KGCVPLNAIP 
SGVVPLATVP 
DGCVPLNIIP 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 
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I I I I I I 1 I I I ! I 

4385 4395 4405 4415 4425 4435 

EMCR ATSASKLTIV S POLES YSKI VCDGSVHYAG WWTLNDVKD NDGRPVHVKE ITR EN 

22 9E ATSAARLVVV VPDHDSFVKM MVDGFVHYAG VVWTLQEVKD NDGKNVHLKD VTK EN 

PEDV AVSATKLNIV TSDIDSYNRI QREGCVHYAG TIWNIIDIKD NDGKVVHVKE VTA QN 

TGEV AASATEaVVI TPSLEVFSKI RQENNVHYAG AIWTIVEVKD ANGSHVHLKE VTA AN 

OC43 SLAANTLNII VPDKSVYDQV VDNVYVTYAG NVWQIQTIQD SOGTNKQLNE IS 

BoCoV SLAANTLTII VPDKSVYDQV VDNVYVTYAG NVWQIQTIQD SDGTNKQLHE IS 

MHV SLTSNTLTII VPDKQVFDQV VDNVYVTYAG NVWHIQSIQD ADGAVKQLNE ID 

AIPV IVCSNKLTLV IPDPETWVKC VEGVHVTYST WWNIDTVID ADGTELHPTS TGSGLTYCIS 
SARS Gov LTTAAKLMVV VPDYGTYKNT CDGNTETYAS ALWEIQQVVD ADSKIVQLSE INM DN 



....I I I I I I I I 1 I 

4445 4455 4465 4475 4485 4495 

EMCR VETLTWPLIL NCER WKLQNNEIM PGKLKQKPMK AEG — DGGVL GDGNALYNTE 

22 9E QEILVWPLIL TCER WKLQNNEIM PGKMKVKATK GEG — DGGIT SEGNALYNNE 

PEDV AESLSWPLVL GCER IVKLQNNEII PGKLKQRSIK AEG — DG-IV GEGKALYNNE 

TGEV ELNLTWPLSI TCER TTKLQNNEIM PGKLKERAVR ASATLDGEAF GSGKALMASE 

CX:43 -DDCNWPLVI lANRY-NEVS ATVLQNNELM PAKLKIQVVN SGP — DQTCN TPTQCYYNNS 

BoCoV -DDCNWPLVI lANRH-NEVS ATVLQNNELM PAKLKTQVVN SGP — DQTCN TPTQCYYNNS 

MHV -VNITWPLVI AANRH-NEVS SVVLQNNELM PQKLRTQVVN SGS — DMNCN TPTQCYYNTT 

AIPV GANIAWPLKV NLTRNGHNKV DVVLQNNELM PHGVKTKACV AGVD-QAHCS VESKCYYTNI 

SARS Gov SPNLAWPLIV TALRA-N — S AVKLQNNELS PVALRQMSCA AGTTQTACTD DNALAYYNNS 



I I I I I I I I I I I I 

4505 4515 4525 4535 4545 4555 

EMCR GGKTFMYAYI SNKADLKFVK WEY-EGG-CN TIELDSPCRF MVETPNGPQV KYLYFVKNLN 

229E GGRAFMYAYV TTKPGMKYVK WEH-DSG-W TVELEPPCRF VIDTPTGPQI KYLYFVKNLN 

PEDV GGRTFMYAFI SDKPDLRWK WEF-DGG-CN TIELEPPRKF LVDSPNGAQI KYLYFVRNLN 

TGEV SGKSFMYAFI ASDNNLKYVK WES-NND-II PIELEAPLRF YVDGANGPEV KYLYFVKNLN 

OC43 NNGKIVYAIL SDVDGLKYTK ILKDDGN-FV VLELDPPCKF TVQDAKGLKI KYLYFVKGCN 

BoCoV YNGKIVYAIL SDVDGLKYTK ILKDDGN-FV VLELDPPCKF TVQDVKGLKI KYLYFVKGCN 

MHV GMGKIVYAIL SDCDGLKYTK IVKEDGN-CV VLELDPPCKF SVQDVKGLKI KYLYFVKGCN 

AIPV SGNSVVAAIT SSNPNLKVAS FLNEAGN-QI YVDLDPPCKF GMKVGVKVEV VYLYFIJCNTR 

SARS CoV KGGRFVLALL SDHQDLKWAR FPKSDGTGTI YTELBPPCRF VTDTPKGPKV KYLYFIKGLN 



I I I I 1 I I I I I I I 

4565 4575 4585 4595 4605 4615 

EMCR TLRRGAVLGF IGATIRLQAG -KQTELAVNS GLLTACAFSV DPATTYLEAV KHGAKPVSNC 

229E NLRRGAVLGY IGATVRLQAG -KQTEFVSNS HLLTHCSFAV DPAAAYLDAV KQGAKPVGNC 

PEDV TLRRGAVLGY IGATVRLQAG -KQTEQAINS SLLTLCAFAV DPAKTYIDAV KSGHKPVGNC 

TGEV TLRRGAVLGY IGATVRLQAG -KPTEHPSNS SLLTLCAFSP DPAKAYVDAV KRGMQPVNNC 

OC43 TLARGWVVGT ISSTVRLQAG -TATEYASNS SILSLCAFSV DPKKTYLDFI QQGGTPIANC 

BoCoV TLARGWVVGT ISSTVRLQAG -TATEYASNS SILSLCAFSV DPKKTYLDFI QQGGTPIANC 

MHV TLARGWVVGT LSSTVRLQAG -TATEYASNS AIRSLCAFSV DPKKTYLDYI QQGGAPVTNC 

AIPV SIVRGMVLGA ISNVVVLQSK GHETEEVDAV GILSLCSFAV DPADTYCKYV AAGNQPLGNC 

SARS CoV NLNRGMVLGS LAATVRLQAG -NATEVPANS TVLSFCAFAV DPAKAYKDYL ASGGQPITNC 



....I 1 I I I i I I I I I I 

4625 4635 4645 4655 4665 4675 

EMCR IKMLSNGAGN GQAITTSVDA NTNQDSYGGA SICLYCRAHV PHP SMD GYCKFKGKCV 

229E VKMLTNGSGS GQAITCTIDS NTTQDTYGGA SVCIYCRAHV AHP TMD GFCQYKGKWV 

PEDV VKMLANGSGN GQAVTNGVEA STNQDSYGGA SVCLYCRAHV EHP SMD GFCRLKGKYV 

TGEV VKMLSNGAGN GMAVTNGVEA NTQQDSYGGA SVCIYCRCHV EHP AID GLCRYKGKFV 

OC43 VKMLCDHAGT GMAITVKPDA TTSQDSYGGA SVCIYCRARV EHP DVD GLCKLRGKFV 

BoCoV VKMLCDHAGT GMAITVKPDA TTSQDSYGGA SVCIYCRARV EHP DVD GLCKLRGKFV 

MHV VKMLCDHAGT GMAITIKPEA TTNQDSYGGA SVCIYCRSRV EHP DVD GLCKLRGKFV 

AIPV VKMLTVHNGS GFAITSKPSP TPDQDSYGGA SVCLYCRAHI AHPGSVGNLD GRCQFKGSFV 
SARS COV VKMLCTHTGT GQAITVTPEA NMDQESFGGA SCCLYCRCHI DHP NPK GFCDLKGKYV 



I I I I I I I I \ I I I 

4685 4695 4705 4715 4725 4735 

EMCR QVPIGCL-DP IRFCLENNVC NVCGCWLGHG CACDRTTIQS VDISYLNEQ 

229E QVPIGTN-DP IRFCLENTVC KVCGCWLNHG CTCDRTAIQS -FDNSYLNES 

PEDV QVPLGTV-DP IRFVLENDVC KVCGCWLSNG CTCDRSIMQS T 

TGEV QIPTGTQ-DP IRFCIENEVC VVCGCWLNNG CMCDRTSMQS F TVDQSYLNEC 

OC43 QVPVGIK-DP VSYVLTHDVC RVCGFWRDGS CSCVSTDTTV Q SKDT 

BoCoV QVPVGIK-DP VSYVLTHDVC QVCGFWRDGS CSCVSTDTTV Q SKDTNFLNGF 

MHV QVPLGIK-DP VSYVLTHDVC QVCGFWRDGS CSCVGTGSQF Q SKDTNFLNGF 

AIPV QIPTTEK-DP VGFCLRNKVC TVCQCWIGYG CQCDSLRQPK SSVQSVAGAS DFDKNYLNGY 
SARS CoV QIPTTCANDP VGFTLRNTVC TVCGMWKGYG CSCDQLREPL M QSADASTFLN 



I . . 

4745 

EMCR GVLVQLD 
229E GALVPLD 

PEDV 

TGEV GVLVQLD 

OC43 

BoCoV GVRV 

MHV GVQV 

AIPV GVAVRLG 
SARS Gov GFAV 
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C. Putative orf lb 

— I — I — I — I — I — I — I — I — I — I — I — I 

5 15 25 35 45 55 

EMCR RARGSSAARL EPCN-GTDID KCVRAFDIYN KNVSFLGKCL 

229E EPCN-GTDID YCVRAFDVYN KDASFIGKNL 

PEDV YGLFK RVRGSSAARL EPCN-GTDTQ HVYRAFDIYN KDVACLGKFL 

TGEV EPCN-GTDPD HVSRAFDIYN KDVACIGKFL 

BoCoV FFKR VRGTSVDARL VPCASGLSTD VQLRAFDICN ASVAGIGLHL 

OC43 FFKR VRGTSVDARL VPCASGLSTD VQLRAFDIYN ASVAGIGLHL 

MHV LFLCRHRLPV SVKRHELFKR VRGTSVNARL VPCASGLDTD VQLRAFDICN ANRAGIGLYY 

AIPV MFQNL 

SARS CoV TPCGTGTSTD WYRAFDIYN EKVAGFAKFL 

I I ....I I ....1 I I I I I I I 

65 75 85 95 105 115 

EMCR KMNCVRFKNA DL KDGYFVIKRC TKSVMEHEQS MYNLLNFSGA LAEHDFFTWK 

22 9E KSNCVRFKNV DK DDAFYIVKRC IKSVMDHEQS MYNLLKGCNA VAKHDFFTWH 

PEDV KVNCVRLKNL DK HDAFYVVKRC TKSAMEHEQS lYSRLEKCGA lAEHDFFTWK 

TGEV KTNCSRFRNL DK HDAYYIVKRC TKTVMDHEQV CYNDLKDSGA VAEHDFFTYK 

BoCOV KVNCCRFQRV — DENG — DK LDQFFWKRT DLTIYNREME CYERVKDCKF VAEHDFFTFD 

CX:43 KVNCCRFQRV — DENG — DK LDQFFWKRT DLTIYNREMK CYERVKDCKF VAEHDFFTFD 

MHV KVNCCRFQRA — DEDG — NT LDKFFVIKRT NLEVYNKEKE CYELTKECGV VAEHEFFTFD 

AIPV ' KRNCARFQEL RDTEDGNLEY LDSYFVVKQT TPSNYEHEKS CYEDLKS-EV TADHDFFVFN 

SARS CoV KTNCCRFQEK — DEEG — NL LDSYFVVKRH TMSNYQHEET lYNLVKDCPA VAVHDFFKFR 

I I I I ....I I I I I ! I I 

125 135 145 155 165 175 

EMCR DGRVIYGNVS RHNLTKYTMM DLVYAMRNFD EQNCDVLKEV LVLTGCCDNS YFDSKG 

229E EGRTIYGNVS RQDLTKYTMM DLCFALRNFD EKDCEVFKEI LVLTGCCSTD YFEMKN 

PEDV DGRAIYGNVC RKDLTEYTMM DLCYALRNFD ENNCDVLKSI LIKVGACEES YFNNKV 

TGEV EGRCEFGNVA RRNLTKYTMM DLCYAIRNFD EKNCEVLKEI LVTVGACTEE FFENKD 

BoCoV VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDI LSIYAGCEQS YFTKKD 

OC43 VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDI LSIYAGCEQS YFTKKD 

MHV VEGSRVPHIV RKDLSKYTML DLCYALRHFD RNDCSTLKEI LLTYAECDES YFQKKD 

AIPV KN lYNIS RQRLTKYTMM DFCYALRHFD PKDCEVLKEI LVTYGCIEDY HPKWFEENKD 

SARS CoV VDGDMVPHIS RQRLTKYTMA DLVYALRHFD EGNCDTLKEI LVTYNCCDDD YFNKKD 

I I I I ....I I I I I I I I 

185 195 205 215 225 235 

EMCR WYDPVENEDI HRVYASLGKI VARAMLKCVA LCDAMVAKGV VGVLTLDNQD LNGNFYDFGD 

229E WFDPIENEDI HRVYAALGKV VANAMLKCVA FCDEMVLKGV VGVLTLDNQD LNGNFYDFGD 

PEDV WFDPVENEDI HRVYALLGTI VARAMLKCVK FCDAMVEQGI VGVVTLDNQD LNGDFYDFGD 

TGEV HFDPVENEAI HEVYAKLGPI VANAMLKCVA FCDAIVEKGY IGVITLDNQD LNGNFYDFGD 

BOCOV WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGILTLDNQD LNGKWYDFGD 

OC43 WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGVLTLDNQD LNGKWYDFGD 

MHV WYDFVENSDI INVYKKLGPI FNRALLNTAK FADTLVEAGL VGVLTLDNQD LYGQWYDFGD 

AIPV WYDPIENSKY YVMLAKMGPI VRRALLNAIE FGNLMVEKGY VGVXTLDNQD LNGKFYDFGD 

SARS CoV WYDFVENPDI LRVYANLGER VRQSLLKTVQ FCDAMRDAGI VGVLTLDNQD LNGNWYDFGD 

I I I I ....I I I I I I 1 I 

245 255 265 275 285 295 

EMCR FVVSLPNMGV PCCTSYYSYM MPIMGLTNCL ASECFVKSDI FGSDFKTFDL LKYDFTEHKE 

22 9E FVLCPPGMGI PYCTSYYSYM MPVMGMTNCL ASECFMKSDI FGQDFKTFDL LKYDFTEHfCE 

PEDV FTCSIKGMGV PICTSYYSYM MPVMGMTNCL ASECFVKSDI FGEDFKSYDL LEYDFTEHKT 

TGEV FVKTAPGFGC ACVTSYYSYM MPLMGMTSCL ESENFVKSDI YGSDYKQYDL LAYDFTEHKE 

BoCoV YVIAAPGCGV AIADSYYSYM MPMLTMCHAL DCELYVNNAY R LFDL VQYDFTDYKL 

OC43 YVIAAPGCGV AIADSYYSYI MPMLTMCHAL DCELYVNNAY R LFDL VQYDFTDYKL 

MHV FVKTVPGCGV AVADSYYSYM MPMLTMCHAL DSELFINGTY R EFDL VQYDFTDFKL 

AIPV FQKTAPGAGV PVFDTYYSYM MPIIAMTDAL APERYFEYDV HKG-YKSYDL LKYDYTEEKQ 

SAEIS CoV FVQVAPGCGV PIVDSYYSLL MPILTLTRAL AAESHMDADL AKP-LIKWDL LKYDFTEERL 

....I I I I I I I I I I I 

305 315 325 335 345 355 

EMCR NLFNKYFKHW SFDYHPNCSO CYDDMCVIHC ANFNTLFATT IPGTAFGPLC RKVFIDGVPL 

229E VLFNKYFKYW GQDYHPDCVD CHDEMCILHC SNFNTLFATT IPNTAFGPLC RKVFIDGVPV 

PEDV ALFNKYFKYW GLQYHPNCVD CSDEQCIVHC ANFNTLFSTT IPITAFGPLC RKCWIDGVPL 

TGEV YLFQKYFKYW DRTYHPNCSD CTSDECIIHC ANFNTLFSMT IPMTAFGPLV RKVHIDGVPV 

BoCoV ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV RQIFVDGVPF 

OC43 ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV RQIFVDGVPF 

MHV ELFNKYFKYW SMTYHPNTCE CEDDRCIIHC ANFNILFSMV LPKTCFGPLV RQIFVDGVPF 

AIPV ELFQKYFKYW DQEYHPNCRD CSDDRCLIHC ANFNILFSTL IPQTSFGNLC RKVFVDGVPF 

SARS CoV CLFDRYFKYW DQTYHPNCIN CLDDRCILHC ANFNVLFSTV FPPTSFGPLV RKIFVDGVPF 
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I I 1 I I.. ..I I I I I I I 

36S 375 385 395 405 415 

EMCR VTTAGYHFKQ LGLVWNKDVN THSVRLTITE LLQFVTDPSL IIASSPALVD QRTICFSVAA 

229E VATAGYHFKQ LGLVWNKDVN THSTRLTITE LLQFVTDPTL IVASSPALVD KRTVCFSVAA 

PEDV VTTAGYHFKQ LGIVWNNDLN LHSSRLSINE LLQFCSDPAL LIASSPALVD QRTVCFSVAA 

TGEV VVTAGYHFKQ LGIVWNLDVK LDTMKLSMTD LLRFVTDPTL LVASSPALLD QRTVCFSIAA 

BOCOV WSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD LRTCCFSVAA 

CX:43 WSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD LRTCCFSVAA 

MHV WSIGYHYKE LGVVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALLD LRTCCFSVAA 

AIPV lATCGYHSKE LGVIMNQDNT MSFSKMGLSQ LMQFVGDPAL LVGTSNNLVD LRTSCFSVCA 

SARS CoV VVSTGYHFRE LGVVHNQDVN LHSSRLSFKE LLVYAADPAM HAASGNLLLD KRTTCFSVAA 

I I I I I.... I I I I 1 I 1 

425 435 445 455 465 475 

EMCR LSTGLTNQVV KPGHFNEEFY NFLRLRGFFD EGSELTLKHF FFAQNGOAAV KDFDFYRYNK 

22 9E LSTGLTSQTV KPGHFNKEFY DFLRSQGFFD EGSELTLKHF FFTQKGDAAI KDFDYYRYNR 

PEDV LGTGMTNQTV KPGHFNKEFY DFLLEQGFFS EGSELTLKHF FFAQKVDAAV KDFDYYRYNR 

TGEV LSTGITYQTV KPGHFNKDFY DFITERGFFE EGSELTLKHF FFAQGGEAAM TDFNYYRYNR 

BoCoV ITSGVKFQTV KPGNFNQDFY DFILSKGLLK EGSSVDLKHF FFTQDGNAAI TDYNYYKYNL 

OC43 ITSGVKFQTV KPGNFNQDFY DFVLSKGLLK EGSSVDLKHF FFTQDGNAAI TDYNYYKYNL 

MHV ITSGVKFQTV KPGNFNQDFY EFILSKGLLK EGSSVDLKHF FFTQDGNAAI TDYNYYKYNL 

AIPV LTSGITHQTV KPGHFNKDFY DFAEKAGMFK EGSSIPLKHF FYPQTGNAAI NDYDYYRYNR 

SARS CoV LTNNVAFQTV KPGNFNKDFY DFAVSKGFFK EGSSVELKHF FFAQDGNAAI SDYDYYRYNL 

I I I I I I I I I I I I 

485 495 505 515 525 535 

EMCR PTILDICQAR VTYKIVSRYF DIYEGGCIKA CEVWTNLNK SAGWPLNKFG KASLYYESIS 

22 9E PTMLDIGQAR VAYQVAARYF DCYEGGCITS REVWTNLNK SAGWPLNKFG KAGLYYESIS 

PEDV PTVLDICQAR WYQIVQRYF DIYEGGCITA KEVWTNLNK SAGYPLNKFG KAGLYYESLS 

TGEV VTVLDICQAQ FVYKIVGKYF ECYDGGCINA REVWTNYDK SAGYPLNKFG KARLYYETLS 

BoCoV PTMVDIKQLL FVLEWYKYF EIYDGGCIPA AQVIVNNYDK SAGYPFNKFG KARLYYEALS 

OC43 PTMVDIKQLL FVLEWYKYF EIYDGGCIPA SQVIVNNYDK SAGYPFNKFG KARLYYEALS 

MHV PTMVDIKQLL FVLEWNKYF EIYDGGCIPA TQVIVNNYDK SAGYPFNKFG KARLYYEALS 

AIPV PTMFDICQLL FCLEVTSKYF ECYEGGCIPA SQVWNNLDK SAGYPFNKFG KARLYYE-MS 

SARS CoV PTMCDIRQLL FWEWDKYF DCYDGGCINA NQVIVNNLDK SAGFPFNKWG KARLYYDSMS 

I I f I I I I ! I I I I 

545 555 565 575 585 595 

EMCR YEEQDALFAL TKRNVLPTMT QLNLKYAISG KERARTVGGV SLLSTMTTRQ YHQKHLKSIV 

229E YEEQDAIFSL TKRNILPTMT QLNLKYAISG KERARTVGGV SLLATMTTRQ FHQKCLKSIV 

PEDV YEEQDELYAY TKRNILPTMT QLNLKYAISG KERARTVGGV SLLSTMTTRQ YHQKHLKSIV 

TGEV YEEQDALFAL TKRNVLPTMT QMNLKYAISG KARARTVGGV SLLSTMTTRQ YHQKHLKSIA 

BoCoV FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM FHQKCLKSIA 

OC43 FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM FHQKCLKSIA 

MHV FEEQDEVYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM FHQKCLKSIA 

AIPV LEEQDQLFEI TKKNVLPTIT QMNLKYAISA KNRARTVAGV SILSTMTNRQ FHQKILKSIV 

SARS Gov YEDQDALFAY TKRNVIPTIT QMNLKYAISA KNRARTVAGV SICSTMTNRQ FHQKLLKSIA 

I I I I I I I I t 1 1 I 

605 615 625 635 645 655 

EMCR NTRNATWIG TTKFYGGHNN MLRTLIDGVE NPMLMGWDYP KCDRALPNMI RMISAMVLGS 

22 9E ATRNATWIG TTKFYGGWDN MLKNLMADVD DPKLMGHDYP KCDRAMPSMI RMLSAMILGS 

PEDV NTRGASWIG TTKFYGGWDN MLKNLIDGVE NPCLMGWDYP KCDRALPNMI RMISAMILGS 

TGEV ATRNATWIG STKFYGGWDN MLKNLMRDVD NGCLMGWDYP KCDRALPNMI RMASAMILGS 

BoCoV ATRGVPVVIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNIL RIVSSLVLAR 

OC43 ATRGVPWIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNLL RIVSSLVLAR 

MHV ATRGVPWIG TTKFYGGWDD MLRRLIKDVD SPVLMGWDYP KCDRAMPNIL RIISSLVLAR 

AIPV NTRNASVVIG TTKFYGGWDN MLRNLIQGVE DPILMGWDYP KCDRAMPNLL RIAASLVLAR 

SARS CoV ATRGATWIG TSKFYGGWHN MLKTVYSDVE TPHLMGWDYP KCDRAMPNML RIMASLVLAR 

I I I I I I I I I I 1 I 

665 675 685 695 705 715 

EMCR KHVNCCTVTD RFYRLGNELA QVLTEWYSN GGFYFKPGGT TSGDASTAYA NSIFNIFQAV 

22 9E KHVTCCTASD KFYRLSNELA QVLTEWYSN GGFYFKPGGT TSGDATTAYA NSVFNIFQAV 

PEDV KHTTCCSSTD RFFRLCNELA QVLTEWYSN GGFYLKPGGT TSGDATTAYA NSVFNIFQAV 

TGEV KHVGCCTHND RFYRLSNELA QVLTEVVHCT GGFYFKPGGT TSGDGTTAYA NSAFNIFQAV 

BoCoV KHEACCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA NSVFNICQAV 

OC43 KHETCCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA NSVFNICQAV 

MHV KHDSCCSHTD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA NSVFNICQAV 

AIPV KHTNCCSWSE RIYRLYNECA QVLSETVLAT G6IYVKPGGT SSGDATTAYA NSVFNIIQAT 

SARS COV KHNTCCNLSH RFYRLANECA QVLSEMVMCG GSLYVKPGGT SSGDATTAYA NSVFNICQAV 

I 1 I 1 I 1 I I 1 I I I 

725 735 745 755 765 775 

EMCR SSNINRLLSV PSDSCNNVNV RDLQRRLYDN CYRLTSVEES FIDDYYGYLR KHFSMMILSD 

229E SSNINCVLSV NSSNCNNFNV KKLQRQLYDN CYRNSNVDES FVDDFYGYLQ KHFSMMILSD 

PEDV SANVNKLLSV DSNVCHNLEV KQLQRKLYEC CYRSTIVDDQ FWEYYGYLR KHFSMMILSD 

TGEV SANVNKLLGV DSNACNNVTV KSIQRKIYDN CYRSSSIDEE FWBYFSYLR KHFSMMILSD 

BOCOV SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDMVDST FVTEYYEFLN KHFSMMILSD 

OC43 SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDKVDST FVTEYYEFLN KHFSMMILSD 

MHV SANVCSLMAC NGHKIEDLSI RELQKRLYSN VYRADHVDPA FVNEYYEFLN KHFSMMILSD 

AIPV SANVARLLSV ITRDIVYDNI KSLQYELYQQ VYRRVNFDPA FVEKFYSYLC KNFSLMILSD 

SARS COV TANVNALLST DGNKIADKYV RNLQHRLYEC LYRNRDVDHE FVDEFYAYLR KHFSMMILSD 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



54/87 



PCT/NL2004/000805 



I I I I I I 1 I I 1 I I 

785 795 805 815 825 835 

EMCR DGWCYNKDY AELGYIADXS AFKATLYYQN NVFMSTSKCW VEEDLTKGPH EFCSQHTMQI 

229E DSVVCYNKTY AGLGYIADIS AFKATLYYQN GVFMSTAKCW TEEDLSIGPH EFCSQHTMQI 

PEDV DGWCYNNDY ASLGYVADLN AFKAVLYYQN NVFMSASKCW lEPDINKGPH EFCSQHTMQI 

TGEV DGWCYNKDY ADLGYVADIN AFECATLYYQN NVFMSTSKCW VEPDLSVGPH EPCSQHTLQl 

BoCoV DGVVCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VENDINNGPH EFCSQHTMLV 

OC43 DGVVCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VEHDINNGPH EFCSQHTMLV 

MHV DGVVCYNSEF ASKGYIANIS AFQQVLYYQN NVFMSEAKCW VETDIEKGPH EFCSQHTMLV 

AIPV DGVVCYNNTL AKQGLVADIS GFREVLYYQN NVFMADSKCW VEPDLEKGPH EFCSQHTMLV 

SARS CoV DAVVCYNSNY AAQGLVASIK NFKAVLYYQN NVFMSEAKCW TETDLTKGPH EFCSQHTMLV 

I I I I I I I I I I I I 

845 855 865 875 885 895 

EMCR VDKDGTYYLP YPDPSRILSA GVFVDDWKT DAWLLXRYV SLAIDAYPLS KHPNSEYRKV 

22 9E VDENGKYYLP YPDPSRIISA GVFVDDITKT DAVILLERYV SLAIDAYPLS KHPKPEYRKV 

PEDV VDKEGTYYLP YPDPSRILSA GVFVDDWKT DAVVLLERYV SLAIDAYPLS KHENPEYKKV 

TGEV VGPDGDYYLP YPDPSRILSA GVFVDDIVKT DNVIMLERYV SLAIDAYPLT KHPKPAYQKV 

BoCoV KMDGDDVYLP YPVPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV YHENEEYQKV 

OC43 KMDGDDVYLP YPNPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV YHENEEYQKV 

MHV KMDGDEVYLP YPDPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV YHENPEYQNV 

AIPV EVDGCPKYLP YPDPSRILGA CVFVDDVDKT EPVAVMERYI ALAIDAYPLV HHENEEYKKV 

SARS CoV KQGDDYVYLP YPDPSRILGA GCFVDDIVKT DGTLMIERFV SLAIDAYPLT KHPNQEYADV 

I I I I 1 I 1 I i I I I 

905 915 925 935 945 955 

EMCR FYVLLDWVKH LNKNLNEGVL ESFSVTLLDN QEDKFWCEDF YASMYENSTI LQAAGLCVVC 

22 9E FYALLDWVKH LNKTLNEGVL ESFSVTLLDE HESKFWDESF YASMYEKSTV LQAAGLCWC 

PEDV FYVLLDWVKH LYKTLNAGVL ESFSVTLLED STAKFWDESF YANMYEKSAV LQSAGLCVVC 

TGEV FYTLLDWVKH LQKNLNAGVL DSFSVTMLEE GQDKFWSEEF YASLYEKSTV LQAAGMCWC 

BoCoV FRVYLEYIKK LYNELGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV MQSVGACWC 

OC43 FRVYLAYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV MQSVGACWC 

MHV FRVYLEYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDBTF YKNMYLRSAV MQSVGACWC 

AIPV FFVLLAYIRK LYQELSQNML MDYSFVMDID KGSKFWEQEF YENMYRAPTT LQSCGVCWC 

SARS CoV FHLYLQYIRK LHDELTGHML DMYSVMLTND NTSRYWEPEF YEAMYTPHTV LQAVGACVLC 

I I I I 1 I I I I I I I 

965 975 985 995 1005 1015 

EMCR GSQTVLRCGD CLRKPMLCTK CAYDHVFGTD HKFILAITFY VCNASGC6VS DVKKLYLGGL 

229E GSQTVLRCGD CLRRPMLCTK CAYDHVFGTD HKFILAITPY VCNTSGCNVN DVTKLYLGGL 

PEDV GSQTVLRCGD CLRRPMLCTK CAYDHVIGTT HKFILAITPY VCCASECGVN DVTKLYLGGL 

TGEV GSQTVLRCGD CLRRPLLCTK CAYDHVMGTK HKFIMSITPY VCSFNGCNVN DVTKLFLGGL 

BoCoV SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN DVTKLYLGGM 

OC4 3 SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN DVTKLYLGGM 

MHV SSQTSLRCGS CIRKPLLCCK CAYDHVMSTD HKYVLSVSPY VCNSPGCDVN DVTKLYLGGM 

AIPV NSQTILRCGN CIRKPFLCCK CCYDHVMHTD HKNVLSINPY ICSQLGCGEA DVTKLYLGGM 

SARS COV NSQTSLRCGA CIRRPFLCCK CCYDHVISTS HKLVLSVNPY VCNAPGCDVT DVTQLYLGGM 

I I I I I I ....I I I I I I 

1025 1035 1045 1055 1065 1075 

EMCR NYYCTNHKPQ LSFPLCSAGN IFGLYKNSAT GSLDVEVFNR LATSDWTDVR DYKLANDVKD 

229E NYYCVDHKPH LSFPLCSAGN VFGLYKSSAL GSMDIDVFNK LSTSDWSDIR DYKLANDAKE 

PEDV SYWCHEHKPR LAFPLCSAGN VFGLYKNSAT GSPDVEDFNR lATSDWTDVS DYKLANDVKD 

TGEV SYYCMNHKPQ LSFPLCANGN VFGLYKSSAV GSEAVEDFNK LAVSDWTNVE DYKLANNVKE 

BoCoV SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIDDFNR lASCKWTDVD DYILANECTE 

OC43 SYYCEDHKPQ YSFKLVMNGL VFGLYKQSCT GSPYIDDFNR lASCKWTDVD DYILANECTE 

MHV SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIEDFNK lASCKWTEVD DYVLANECTE 

AIPV SYFCGNHKPK LSIPLVSNGT VFGIYRAMCA GSENVDDFNQ LATTNWSIVE PYILANRCSD 

SARS CoV SYYCKSHKPP ISFPLCANGQ VFGLYKNTCV GSONVTDFNA lATCDWTNAG DYILANTCTE 

I I I I I I I I I I I I 

1085 1095 1105 1115 1125 1135 

EMCR TLRLFAAETI KAKEESVKSS YAFATLKEVV GPKELLLSWE SGKVKPPLNR NSVFTCFQIS 

22 9E SLRLFAAETV KAKEESVKSS YAYATLKEIV GPKELLLLWE SGKAKPPLNR NSVFTCFQIT 

PEDV SLRLFAAETI KAKEESVKSS YACATLHEVV GPKELLLKWE VGRPKPPLNR NSVFTCYHIT 

TGEV SLKIFAAETV KAKEESVKSE YAYAVLKEVI GPKEIVLQWE ASKTKPPLNR NSVFTCFQIS 

BoCoV RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK NYVFTGYHFT 

OC43 RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK NYVFTGYHFT 

MHV RLKLFAAETQ KATEESFKQC YASATIREIV SDRELILSWE IGKVRPPLNK NYVFTGYHFT 

AIPV SLRRFAAETV KATEELHKQQ FASAEVREVF SDRELILSWE PGKTRPPLNR NYVFTGYHFT 

SARS CoV RLKLFAAETL KATEETFKLS YGIATVREVL SDRELHLSWE VGKPRPPLNR NYVFTGYRVT 

I I 1 I ! I I 1 I 1 I I 

1145 1155 1165 1175 1185 1195 

EMCR KDSKFQIGEF IFEKVEYGSD TVTYKSTVTT KLVPGMIFVL TSHNVQPLRA PTIANQEKYS 

229E KDSKFQVGEF VFEKVDYGSD TVTYKSTATT KLVPGMLFIL TSHNVAPLRA PTMANQEKYS 

PEDV KNTKFQIGEF VFEKAEYDND AVTYKTTATT KLVPGMVFVL TSHNVQPLRA PTIANQERYS 

TGEV KDTKIQLGEF VFEQSEYGSD SVYYKSTSTY KLTPGMIFVL TSHNVSPLKA PILVNQEKYN 

BoCoV KNGKTVLGEY VFDKSEL-TN GVYYRATTTY KLSVGDVFVL TSHSVANLSA PTLVPQENYS 

OC43 KNGKTVLGEY VFDKSEL-TN GVYYRATTTY KLSVGDVFVL TSHSVANLSA PTLVPQENYS 

MHV SNGKTVLGEY VFDKSEL-TN GVYYRATTTY KLSVGDVFIL TSHAVSSLSA PTLVPQENYT 

AIPV RTSKVQLGDF TFEKGEG-KD VVYYKATSTA KLSVGDIFVL TSHNWSLVA PTLCPQQTFS 

SARS CoV KNSKVQIGEY TFEKGDY-GD AVVYRGTTTY KLNVGDYFVL TSHTVMPLSA PTLVPQEHYV 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



55/87 



PCT/NL2004/000805 



I I I. ...I I I I I I I 1 I 

1205 1215 1225 1235 1245 1255 

EMCR SIYECLHPAFN VSDAYANLVP YYQLIGKQKI TTIQGPPGSG KSHCSIGLGL YYPGARIVFV 

229E TIYKLHPSFN VSDAYANLVP YYQLIGKQRI TTIQGPPGSG KSHCSIGIGV YYPGARIVFT 

PEDV TIHKLHPAFN IPEAYSSLVP YYQLIGKQKI TTIQGPPGSG KSHCVIGLGL YYPGARIVFT 

TGBV TISKLYPVFN lAEAYNTLVP YYQMIGKQKF TTIQGPPGSG KSHCVIGLGL YYPQARIVYT 

BoCoV SIR-FASVYS VLETFQNNVV NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV YYCTARWYT 

OC43 SIR-FASVYS VLETFQNNVV NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV FYCTARWYT 

MHV SIR-FASVYS VPETFQNNVP NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV YYCTARWYT 

AIPV RFVNLRPNVM VPECFVNNIP LYHLVGKQKR TTVQGPPGSG KSHFAIGLAV YFSSARVVFT 

SARS COV RITGLYPTLN ISDEFSSNVA NYQKVGMQKY STLQGPPGTG KSHFAIGLAL YYPSARIVYT 

I I I I I I I i 1 I I I 

1265 1275 1285 1295 1305 1315 

EMCR ACAHAAVDSL CAKAMTVYSI DKCTRIIPAR ARVECYSGFK PNNTSAQYIF STVNALPECN 

229E ACSHAAVDSL CAKAVTAYSV DKCTRIIPAR ARVECYSGFK PNNNSAQYVF STVNALPEVN 

PEDV ACSHAAVDSL CVKASTAYSN DKCSRIIPQR ARVECYDGFK SNNTSAQYLF STVNALPECN 

TGEV ACSHAAVDAL CEKAAKNFNV DRCSRIIPQR IRVDCYTGFK PNNTNAQYLF CTVNALPEAS 

BoCoV AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF TTINALPEMV 

OC43 AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF TTINALPEMV 

MHV AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVDCYDKFK VNDTTRKYVF TTINALPELV 

AIPV ACSHAAVDAL CEKAFKFLKV DDCTRIVPQR TTVDCFSKFK ANDTGKKYIF STINALPEVS 

SARS CoV ACSHAAVDAL CEKALKYLPI DKCSRIIPAR ARVECFDKFK VNSTLEQYVF CTVNALPETT 

I 1 I I I 1 1 I I 1 I I 

1325 1335 1345 1355 1365 1375 

EMCR ADIWVDEVS MCTNYDLSVI NQRLSYKHIV YVGDPQQLPA PRVMITKGVM EPVDYNVVTQ 

229E ADIWVDEVS MCTNYDLSVI NQRISYKHIV YVGDPQQLPA PRVLISKGVM EPIDYNWTQ 

PEDV ADIWVDEVS MCTNYDLSVI NQRISYRHW YVGDPQQLPA PRVMISRGTL EPKDYNVVTQ 

TGEV CDIVWDEVS MCTNYDLSVI NSRLSYKHIV YVGDPQQLPA PRTLINKGVL QPQDYNWTK 

BoCoV TDIVWDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL EPKYFNTVTK 

OC43 TDIVWDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL EPKYFNTVTK 

MHV TDIIWDEVS MLTNYELSVI NSRVRAKHYV YIGDPAQLPA PRVLLNKGTL EPRYFNSVTK 

AIPV CDILLVDEVS MLTNYELSFI NGKINYQYW YVGDPAQLPA PRTLLN-GSL SPKDYNVVTN 

SARS CoV ADIWFDEIS MATNYDLSW NARLRAKHYV YIGDPAQLPA PRTLLTKGTL EPEYFNSVCR 

I I I I I I 1 I I I I I 

1385 1395 1405 1415 1425 1435 

EMCR RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKPA SKQCFKIFFK G NVQVDN 

22 9E RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKEA SKQCFKIFER G SVQVDN 

PEDV RMCALKPDVF LHKCYRCPAE IVRTVSEMVY ENQFIPVHPD SKQCFKIFCK G NVQVDN 

TGEV RMCTLGPDVF LHKCYRCPAE IVKTVSALVY ENKFVPVNPE SKQCFKMFVK G QVQIES 

BoCoV LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK G VTTHES 

OC43 LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK G VTTHES 

MHV LMCCLGPDIF LGTCYRCPKE IVDTVSALVY HNKLKAKNDN SSMCFKVYYK G QTTHES 

AIPV LMVCVKPDIF LAKCYRCPKE IVDTVSTLVY DGKFIANNPE SRECFKVIVN NGNSDVGHES 
SARS CoV LMKTIGPDMF LGTCRRCPAE IVDTVSALVY DNKLKAHKDK SAQCFKMFYK G VITHDV 

I I I I I I I I I I I I 

1445 1455 1465 1475 1485 1495 

EMCR GSSINRKQLE IVKLFLVKNP SWSKAVFISP YNSQNYVASR FLGLQIQTVD SSQGSEYDYV 

229E GSSINRRQLD WKRFIHKNS TWSKAVFISP YNSQNYVAAR LLGLQTQTVD SAQGSEYDYV 

PEDV GSSINRRQLD VVRMFLAKNP RWSKAVFISP YNSQNYVASR LLGLQIQTVD SSQGSEYDYV 

TGEV NSSINNKQLE WKAFLAHNP KWRKAVFISP YNSQNYVARR LLGLQTQTVD SAQGSEYDYV 

BoCoV SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD SAQGSEYDYV 

OC43 SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD SAQGSEYDYV 

MHV SSAVNMQQIY LISKFLKANP SWSNAVFISP YNSQNYVAKR VLGLQTQTVD SAQGSEYDFV 

AIPV GSAYNTTQLE FVKDFVCRNK QWREAIFISP YNAMNQRAYR MLGLNVQTVD SSQGSEYDYV 

SARS CoV SSAINRPQIG WREFLTRNP AHRKAVFISP YNSQNAVASR ILGLPTQTVD SSQGSEYDYV 

1 I I I 1 I I I I I I I 

1505 1515 1525 1535 1545 1555 

EMCR lYAQTSDTAH ACNVNRFNVA ITRAKKGIFC VMCDKT-LFD SLKFFEIKHA — DLHSS 

229E IFAQTSDTAH ACNANRFNVA ITRAKKGIFC IMSDRT-LFD ALKFFEITMT — DLQSE 

PEDV lYAQTSDTAH ASNVNRFNVA ITRAKKGILC IMCDRS-LFD LLKFFELKLS —DLQAN 

TGEV lYTQTSDTQH ATNVNRFNVA ITRAKVGILC IMCDRT-MYE NLDFYELKDS KIGLQAKP — 

BoCoV lYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQ-LFE ALQFTTLTVD KVPQAVETRV 

OC43 lYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQ-LFE ALQFTTLTLD KVPQAVETKV 

MHV lYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSSMQ-LFE SLNFSTLTLD KIN NPRL 

AIPV IFCVTADSQH ALNINRFNVA LTRAKRGILV VMRQRDELYS ALKFTELDSE TSLQG 

SARS CoV IFTQTTETAH SCNVNRFNVA ITRAKIGILC IMSDRD-LYD KLQFTSLEIP RRN-VATLQA 

I I I I I I I I I I 1 I 

1565 1575 1585 1595 1605 1615 

EMCR -QVCGLFKNC TRTPLNLPPT HAHTFLSLSD QFKTTGDLAV QIGSNN— VC TYEHVISFMG 

229E -SSCGLFKDC ARNPIDLPPS HATTYLSLSD RFKTSGDLAV QIGNNN — VC TYEHVISYMG 

PEDV -EGCGLFKDC SRGDDLLPPS HANTFMSLAD NFKTDQYLAV QIGVNG — PI KYEHVISFMG 

TGEV -ETCGLFKDC SKSEQYIPPA YATTYMSLSD NFKTSDGLAV NIG-TK — DV KYANVISYMG 

BoCoV QCSTNLFKDC SKSYSGYHPA HAPSFLAVDD KYKATGDLAV CLGIGD-SAV TYSRLISLMG 

OC43 QCSTNLFKDC SKSYSGYHPA HAPSFLAVDD KYKATGDLAV CLGIGD-SAV TYSRLISLMG 

MHV QCTTNLFKDC SRSYAGYHPA HAPSFLAVDD KYKVGGDLAV CLNVAD-SAV TYSRLISLMG 

A-I^V TGLFKIC NKEFSGVHPA YAVTTKALAA TYKVNDELAA LVNVEAGSEI TYKHLISLLG 

S^^RS COV ENVTGLFKDC SKIITGLHPT QAPTHLSVOI KFKTEG-LCV DIPGIP-KDM TYRRLISMMG 
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I I 1 I I I I I I I I I 

1625 1635 1645 1655 1665 1675 

EMCR FRFDISIPGS HSLFCTRDFA IRNVRGWLGM DVESAHVCGD NIGTNVPLQV GFSNGVNFVV 

22 9E FRFDVSMPGS HSLFCTRDFA MRHVRGWLGM DVEGAHVTGD NVGTNVPLQV GFSNGVDFVA 

PEDV FRFDINIPNH HTLFCTRDFA MRNVRGWLGF DVEGAHVVGS NVGTNVPLQL GFSNGVDFVV 

TGEV FRFEANIPGY HTLFCTRDFA MRNVRAWLGF DVEGAHVCGD NVGTNVPLQL GFSNGVDFVV 

BOCOV FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL GFSTGIDFVV 

OC43 FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL GFSTGIDFVV 

MHV FKLDLTLDGY CKLFITRDEA IRRVRAWVGF DAEGAHATRD SIGTNFPLQL GFSTGIDFVV 

AIPV FKMSVNVEGC HNMFITRDEA IRNVRGWVGF DVEATHACGT NIGTNLPFQV GFSTGADFW 

SARS COV FKMNYQVNGY PNMFITREEA IRHVRAWIGF DVEGCHATRD AVGTNLPLQL GFSTGVNLVA 

I I I I I I I I I I I I 

1685 1695 1705 1715 1725 1735 

EMCR QTEGCVSTNF GDVIKPVCAK SPPGEQFRHL VPFLRKGQPW LIVRRRIVQM ISDYLSNLSD 

229E QPEGCVLTNT GSVVKPVRAR APPGEQFTHI VPLLRKGQPW SVLRKRIVQM lADFLAGSSD 

PEDV RPEGCWTES GDYIKPVRAR APPGEQFAHL LPLLKRGQPW DWRKRIVQM CSDYLANLSD 

TGEV QTE6CVITEK GNSIEWKAR APPGEQFAHL IPLMRKGQPW HIVRRRIVQM VCDYFDGLSD 

BoCoV EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGQRW DVVRPRIVQM FADHLIDLSD 

OC43 EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGHRW DVVRPRIVQM FADHLIDLSD 

MHV EATGMFAERD GYVFKKAVAR APPGEQFKHL VPLMSRGQKW DWRIRIVQM LSDHLVDLAD 

AIPV TPEGLVDTSI GNNFEPVNSK APPGEQFKHL RVLFKSAKPW HVIRPRIVQM LADNLCNVSD 

SARS CoV VPTGYVOTEN NTEFTRVNAK PPPGDQFKHL IPLMYKGLPW NWRIKIVQM LSDTLKGLSD 

I I I I I I I I I I I I 

1745 1755 1765 1775 1785 1795 

EMCR ILVFVLWAGS LELTTMRYFV KIGPIKYCY- CGNSATCYNS VSNEYCCFKH ALGCDYVYNP 

229E VLVFVLWAGG LELTTMRYFV KIGAVKHCQ- CGTVATCYNS VSNDYCCFBCH ALGCDYVYNP 

PEDV ILIFVLWAGG LELTTMRYFV KIGPSKSCD- CGKVATCYNS ALHTYCCFKH ALGCDYLYNP 

TGEV ILIFVLWAGG LELTTMRYFV KIGRPQKCE- CGKSATCYSS SQSVYACFKH ALGCDYLYNP 

BoCoV CWLVTWAAN FELTCLRYFA KVGREISCNV STKRATAYNS RTGYYGCWRH SVTCDYLYNP 

OC43 CWLVTWAAN FELTCLRYFA KVGREISCNV CTKRATVYNS RTGYYGCWRH SVTCDYLYNP 

MHV SVVLVTWAAS FELTCLRYFA KVGKEVVCSV CNKRATCFNS RTGYYGCWRH SYSCDYLYNP 

AIPV CWFVTWCHG LELTTLRYFV KIGKEQVCS- CGSRATTFNS HTQAYACWKH CLGFDFVYNP 

SARS CoV RWFVLWAHG FELTSMKYFV KIGPERTCCL CDKRATCFST SSDTYACWNH SVGFDYVYNP 

I I I I 1 I I I I I I I 

1805 1815 1825 1835 1845 1855 

EMCR YAFDIQQWGY VGSLSQNHHT FCNIHRNEHD ASGDAVMTRC LAVHDCFVKN VDWTVTYPFI 

229E YVIDIQQWGY VGSLSTNHHA ICNVHRNEHV ASGDAIMTRC LAVYDCFVKN VDWSITYPMI 

PEDV YCIDIQQWGY KGSLSLNHHE HCNVHRNEHV ASGDAIMTRC LAIHDCFVKN VDWSITYPFI 

TGEV YCIDIQQWGY TGSLSMNHHE VCNIHRNEHV ASGDAIMTRC LAIHDCFVKR VDWSIVYPFI 

BoCoV LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN INWNVEYPII 

OC43 LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN INWNVEYPII 

MHV LIVDIQQWGY TGSLTSNHDL ICSVHKGAHV ASSDAIMTRC LAVHDCFCKS VNWSLEYPII 

AIPV LLVDIQQWGY SGNLQFNHDL HCNVHGHAHV ASVDAIMTRC LAINNAFCQD VNWDLTYPHI 

SARS CoV FMIDVQQWGF TGNLQSNHDQ HCQVHGNAHV ASGDAIMTRC LAVHECFVKR VDWSVEYPII 

I 1 I I I I I I I I I I 

1865 1875 1885 1895 1905 1915 

EMCR ANEKFINGCG RNVQGHWRA ALKLYKPSVI HDIGNPKGVR CA-VTDAKWY CYDKQPVNSN 

229E ANENAINKGG RTVQSHIMRA AIKLYNPKAI HDIGNPKGIR CA-VTDAKWY CYDKNPINSN 

PEDV GNEAVINKSG RIVQSHTMRS VLKLYNPKAI YDIGNPKGIR CA-VTDAKWF CFDKNPTNSN 

TGEV DNEEKINKAG RIVQSHVMKA ALKIFNPAAI HDVGNPKGIR CA-TTPIPWF CYDRDPINNN 

BoCoV SNELSINTSC RVLQRVMLKA AMLCNRYTLC YDIGNPKAIA CV — KDFDFK FYDAQPIVKS 

OC43 SNELSINTSC RVLQRVILKA AMLCNRYTLC YDIGNPKAIA CV — KDFDFK FYDAQPIVKS 

MHV SNEVSVNTSC RLLQRVMFRA AMLCNRYDVC YDIGNPKGLA CV — KGYDFK FYDASPVVKS 

AIPV ANEDEVNSSC RYLQRMYLNA CVDALKVNVV YDIGNPKGIK CVRRGDVNFR FYDKNPIVRN 

SARS CoV GDELRVNSAC RKVQHMWKS ALLADKFPVL HDIGNPKAIK CVPQAEVEWK FYDAQPCSDK 

I I I I I I I I I I I I 

1925 1935 1945 1955 1965 1975 

EMCR VKLLDYD YATHG — QLD GLCLFWNCNV DMYPEFSIVC RFDTRTRSVF NLEGVNGGSL 

22 9E VKTLEYD YMTHG — QMD GLCLFWNCNV DMYPEFSIVC RFDTRTRSTL NLEGVNGGSL 

PEDV VKTLEYD YITHG — QFD GLCLFWNCNV DMYPEFSVVC RFDTRCRSPL NLEGCNGGSL 

TGEV VRCLDYD YMVHG— QMN GLMLFWNCNV DMYPEFSIVC RFDTRTRSKL SLEGCNGGAL 

BOCOV VKTLLYF FEAHKDSFKD GLCMFWNCNV DKYPPNAVVC RFDTRVLNNL NLPGCNGGSL 

OC43 VKTLLYS FEAHKDSFKD GLCMFWNCNV DKYPPNAVVC RFDTRVLNNL NLPGCNGGSL 

MHV VKQFVYK YEAHKDQFLD GLCMFWNCNV DKYPANAVVC RFDTRVLNKL NLPGCNGGSL 

AIPV VKQFEYD YNQHKDKFAD GLCMFWNCNV DCYPDNSLVC RYDTRNLSVF NLPGCNGGSL 

SARS COV AYKIBELFYS YATHHDKFTD GVCLFWNCNV DRYPANAIVC RFDTRVLSNL NLPGCDGGSL 

1 I 1 I I I I I I I 1 I 

1985 1995 2005 2015 2025 2035 

EMCR YVNKHAFHTP AYDKRAFVKL KPMPFFYFDD SDCDVVQ -EQVNYVPLR ASSCVTRCNI 

229E YVNKHAFHTP AYDKRAMAKL KPAPFFYYDD GSCEVVH -DQVNYVPLR ATNCITKCNI 

PEDV YVNNHAFHTP AFDKRAFAKL KPMPFFFYDD TECDKLQ -DSINYVPLR ASNCITKCNV 

TGEV YVNNHAFHTP AYDRRAFAKL KPMPFFYYDD SNCELVD -GQPNYVPLK SNVCITKCNI 

BoCoV YVNKHAFHTK PFSRAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK SATCITRCNL 

OC43 YVNKHAFHTK PFARAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK SATCITRCNL 

MHV YVNKHAFHTS PFTRAAFENL KPMPFFYYSD TPCVYMEGME SKQVDYVPLR SATCITRCNL 

AIPV YVNKHAFYTP KFDRISFRNL KAMPFFFYDS SPCETIQ-VD GVAQDLVSLA TKDCITKCNI 

SARS CoV YVNKHAFHTP AFDKSAFTNL KQLPFFYYSD SPCESHGKQV VSDIDYVPLK SATCITRCNL 
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I I I I I I I I I I I I 

2045 2055 2065 2075 2085 2095 

EMCR GGAVCSKHAN LYQKYVEAYN TFTQAGFNIW VPHSFDVYNL WQIFIET-NL QSLENIAFNV 

22 9E GGAVCSKHAN LYRAYVESYN IFTQAGFNIW VPTTFDCYNL WQTFTEV-NL QGLENIAFNV 

PEDV GGAVCSKHCA MYHSYVNAYN TFTSAGFTIW VPTSFDTYNL WQTFSN— NL QGLENIAFNV 

TGEV GGAVCKKHAA LYRAYVEDYN IFMQAGFTIW CPQNFDTYML WHGFVNSKAL QSLENVAFNV 

BoCoV GGAVCIiKHAE EYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L QSLENWYNL 

OC43 GGAVCLKHAE EYE^YLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L QSLENWYNL 

MHV GGAVCLKHAE DYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTR L QSLENWYNL 

AIPV GGAVCKKHAQ MYAEFVTSYN AAVTAGFTFW VTNKLNPYNL WKSFSA L QSIDNIAYNM 

SARS CoV GGAVCEmHAN EYRQYLDAYN MMISAGFSLW lYKQFDTYNL WNTFTR L QSLENVAYNV 



I I I I I I I I I I I I 

2105 2115 2125 2135 2145 2155 

EMCR VKKGCFTGVD GELPVAWND KVFVRYGDVD NLVFTNKTTL PTNVAFELFA KRKMGLTPPL 

229E VNKGSFVGAD GELPVAISGD KVFVRDGNTD NLVFVNKTSL PTNIAFELFA KRKVGLTPPL 

PEDV LKKGSFVGDE GELPVAWND KVLVRDGTVD TLVFTNKTSL PTNVAFELYA KRKVGLTPPI 

TGEV VKKGAFTGLK GDLPTAVIAD KIMVRDGPTD KCIFTNKTSL PTNVAFELYA KRKLGLTPPL 

BoCoV VKTGHYTGQA GEMPCAIIND KVVAKIDKED VVIFINNTTY PTNVAVELFA KRSIEUIHPEL 

OC43 VKTGHYTGQA GEMPCAIIND KVVAKIDKED VVIFINNTTY PTNVAVELFA KRSVRHHPEL 

MHV VNAGHFDGRA GELPCAVIGE KVIAKIQNED VWFKNNTPF PTNVAVELFA KRSIRPHPEL 

AIPV YKGGHYDAIA GEMPTVITGD KVFVIDQGVE iCAVFVNQTTL PTSVAFELYA KRNIRTLPNN 

SARS CoV VNKGHFDGHA GEAPVSIINN AVYTKVDGID VEIFENKTTL PVNVAFELWA KRNIKPVPEI 



I I I I I.... I I I I I I I 

2165 2175 2185 2195 2205 2215 

EMCR SILKNLGVVA TYKFVLWDYE AERPFTSYTK SVCKYTDFN- EDV CVCFDNSIQG 

22 9E SILKNLGWA TYKFVLWDYE AERPLTSFTK SVCGYTDFA- EDV CTCYDNSIQG 

PEDV TILRNLGVVC TSKCVIWDYE AERPLTTFTK DVCKYTDFE- GDV CTLFDNSIVG 

TGEV TILRNLGVVA TYKFVLWDYE AERPFSNFTK QVCSYTDLD- SEV VTCFDNSIAG 

BoCoV KLFRNLNIDV CWKHVIWDYA RESIFCSNTY GVCMYTDLK- LIDKL NVLFDGRDNG 

OC43 KLFRNLNIDV CWKHVIWDYA RESIFCSNTY GVCMYTDLK FIDKL NVLFDGRDNG 

MHV KLFRNLNIDV CWSHVLWDYA KDSVFCSSTY KVCKYTDLQ CIESL NVLFDGRDNG 

AIPV RILKGLGVDV TNGFVIWDYA NQTPLYRNTV KVCAYTDIE PNGL WLYDDR-YG 



SARS CoV KILNNLGVDI AANTVIWDYK REAPAHVSTI GVCTMTDIAK KPTESACSSL TVLFDGRVEG 



I I I I I 1 I I I I ! I 

2225 2235 2245 2255 2265 2275 

EMCR SYERFTLTTN AVLFSTVVIK N LTPIK LNFGMLNGMP VSSIKSDKGV EKLVNWYTYV 

229E SYERFTLSTN AVLFSATAVK TGGKSLPAIK LNFGMLNGNA lATVKSEDGN IKNINWFVYV 

PEDV SLERFSMTQN AVLMSLTAVK K LTGIK LTYGYLNGVP VNTHED KPFTWYIYT 

TGEV SFERFTTTRD AVLISNNAVK G LSAIK LQYGLLNDLP VSTVGN -KPVTWYIYV 

BoCoV ALEAFKRSNN GVYISTTKVK S LSMIR GPPRAELNGV VVDKVGD -TDCVFYFAV 

OC43 ALEAFKRSNN GVYISTTKVK S LSMIR GPPRAELNGV WDKVGD -TDCVFYFAV 

MHV ALEAFKKCRD GVYINTTKIK S LSMIK GPQRADLNGV WEKVGD -SDVEFWFAM 

AIPV DYQSFLAADN AVLVSTQCYK R YSYVE IPSNLLVQNG MPLKDG ANLYVYK 

SARS CoV QVDLFRNARN GVLITEGSVK G LTPSK GPAQASVNGV TLIGES -VKTQFNYFK 



I. 



I 



EMCR 

229E 

PEDV 

TGEV 

BoCoV 

OC43 

MHV 

AIPV 

SARS CoV 



2285 

RKNG 

RKDG 

RKNG 

RKNG 



2295 

QFQDH 

KPVDH 

KFEDY 

EYVEQ 

RKEGQDVIFS QFDSLRVSSN 
RKEGQDVIFS QFDSLGVSSN 
RRDGDDVIFS RTGSLEPSHY 

RVNG AFVTL 

KVDG IIQQL 



2305 



2315 



..I I 

2325 
— DGFYTQ 
— DGFYTQ 
— DGYFTQ 
— DSYYTQ 



Y 

Y 

P 

I 

QSPQGNLGSN -EPGNVGGND ALATSTIFTQ 
QSPQGNLGSN GKPGNVGGND ALSISTIFTQ 
RSPQGNPGGN -RVGDLSGNE ALARGTIFTQ 

p NTINTQ 

p ETYFTQ 



1 I 

2335 
GRNLSDFTPR 
GRNLQDFLPR 
GRTTADFSPR 
GRTFETFKPR 
SRVISSFTCR 
SRVISSFTCR 
SRFLSSFAPR 
GRSYETFEPR 
SRDLEDFKPR 



I 1 I I i I 1 1 1 I I I 

2345 2355 2365 2375 2385 2395 

EMCR SDMEYDFLNM DMGVFINKYG LEDFNFEHVV YGDVSKTTLG GLHLLISQFR LSKMGVLKAD 

229E STMEEDFLNM DIGVFIQKYG LEDFNFEHVV YGDVSKTTLG GLHLLISQVR LSKMGILKAE 

PEDV SDMEKDFLSM DMGLFINKYG LEDYGFEHVV YGDVSKTTLG GLHLLISQVR LACMGVLKID 

TGEV STMEEDFLSM DTTLFIQKYG LEDYGFEHVV FGDVSKTTIG GMHLLISQVR LAKMGLFSVQ 

BoCoV TDMEKDFIAL DQDVFIQKYG LEDYAFEHIV YGNFNQKIIG GLHLLIGLYR RQQTSNLVIQ 

OC43 TDMEKDFIAL DQDVFIQKYG LEDYAFEHIV YGNFNQKIIG GLHLLIGLYR RQQTSNLWQ 

MHV SEMEKDFMDL DEDVFIAKYS LQDYAFEHVV YGSFNQKIIG GLHLLIGLAR RQQKSNLVIQ 

AIPV SDIBROFLAH SEESFVERYG -KDLGLQHIL YGEVDKPQLG GLHTVIQ1YR LLRANKLNAK 

SARS CoV SQMETDFLEL AMDEFIQRYK LEGYAFEHIV YGDFSHGQLG GLHLMIGLAK R5QDSPLKLE 



I I I I I I I 1 1 I I I 

2405 2415 2425 2435 2445 2455 

EMCR DFVTASDTTL RCCTVTYLNE LSSKVVCTYM DLLLDDFVTI LK SLDLG VISKVHEVII 

229E EFVAASDITL KCCTVTYLND PSSKTVCTYM DLLLDDFVSV LK SLDLT WSKVHEVII 

PEDV EFVSSNDSTL KSCTVTYADN PSSKMVCTYM DLLLDDFVSI LK SLDLS WSKVHEVHV 

TGEV EFMNNSDSTL KSCCITYADD PSSKNVCTYM DILLDDFVTI IK SLDLN WSKWDVIV 

BoCoV EFVS-YDSSI HSYFITDEKS GGSKSVCTVI DILLDDFVAL VK SLNLN CVSKVVNVNV 

OC43 EFVS-YDSSI HSYFITDEKS GGSKSVCTVI DILLDDFVAL VK SLNLN CVSKVVNVNV 

MHV EFVP-YDSSI HSYFITDENS GSSKSVCTVI DLLLDDFVDI VK SLNLN CVSKVVNVNV 

AIPV SVTN-SDSDV MQNYFVLSDN GSYKQVCTVV DLLLDDFLEL LRNILKEYGT NKSKWTVSI 

SARS CoV DFIP-MDSTV KNYFITDAQT GSSKCVCSVI DLLLDDFVEI IK SQDLS VISKVVKVTI 
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I I I I I I I I I I I I 

2465 2475 2485 2495 2505 2515 

EMCR DNKPYRWMLW CKDNHLSTFY PQLQS-AEWK CGYAMPQIYK LQRMCLEPCN LYNYGAGIKL 

229E DNKPWRWMLW CKDNAVATFY PQLQS-AEWK CGYSMPGIYK TQRMCLEPCN LYNYGAGLKL 

PEDV DCKMWRWMLW CKDHKLQTFY PQLQA-SEWK CGYSMPSIYK IQRMCLEPCN LYNYGAGVKL 

TGEV DCKAWRWMLW CENSHIKTFY PQLQS-AEWN PGYSMPTLYK IQRMCLERCN LYNYGAQVKL 

BOCOV DFKDFQFMLW CNDEKVMTFY PRLQAASDWK PGYSMPVLYK YLNSPMERVS LWNYGKPVTL 

OC43 DFKDFQFMLW CNDEKVMTFY PRLQAASDWK PGYSMPVLYK YLNSPMERVS LWNYGKPVTL 

MHV DFKDFQFMLW CNEEKVMTFY PRLQAAADWK PGYVMPVLYK YLESPLERVN LWNYGKPITL 

AIPV DYHSINFMTW FEDGSIKTCY PQLQS— AWT CGYNMPELYK VQNCVMEPCN IPNYGVGITL 

SARS CoV DYAEISFMLW CKDGHVETFY PKLQASQAWQ PGVAMPNLYK MQRMLLEKCD LQNYGENAVI 

I I I I ....I I I I I I I I 

2525 2535 2545 2555 2565 2575 

EMCR PSGIMLNVVK YTQLCQYLNS TTMCVPHNMR VLHYGAGSDK GVAPGTTVLK RWLPPD 

229E PSGIMFNVVK YTQLCQYFNS TTLCVPHNMR VLHLGAGSDY GVAPGTAVLK RWLPHD 

PEDV PDGIMFNVVK YTQLCQYLNS TTMCVPHHMR VLHLGAGSDK GVAPGTAVLR RWLPLD 

TGEV PDGITTNVVK YTQLCQYLNT TTLCVPHKMR VLHLGAAGAS GVAPGSTVLR RWLPDD 

BoCOV PTGCMMNVAK YTQLCQYLNT TTLAVPVNTR VLHLGAGSEK GVAPGSAVLR QWLPAGTILR 

OC43 PTGCMMNVAK YTQLCQYLNT TTLAVPVNMR VLHLGAGSEK GVAPGSAVLR QWLPAG 

MHV PTGCLMNVAK YTQLCQYLNT TTLAVPANMR VLHLGAGSDK DVAPGSAVLR QWLPAG 

AIPV PSGILMNVAK YTQLCQYLSK TTICVPHNMR VMHFGAGSDK GVAPGSTVLK QWLPEG 

SARS CoV PKGIMMNVAK YTQLCQYLNT LTLAVPYNMR VIHFGAGSDK GVAPGTAVLR QWLPTG 

I I I ! I I I I I I I I 

2585 2595 2605 2615 2625 2635 

EMCR All I DNDINDYVSD ADFSITGDCA TVYLEDKFDL LISDMYDG RIKFCDGE 

229E AIVV DNDVVDYVSD ADFSVTGDCA TVYLEDKFDL LISDMYDG — — RTKAIDGE 

PEDV AIIV DNDSVDYVSD ADYSVTGDCS TLYLSDKFDL VISDMYDG KIKSCDGE 

TGEV AILV DNDLRDYVSD ADFSVTGDCT SLYIEDKFDL LVSDLYDG STKSIDGE 

BoCoV QWLPAGTILV HNDLYPFVSD SVATYFGDCI TLPFDCQWDL IISDMYD LLLDIGVH 

OC43 TILV DNDLYPFVSD SVATYFGDCI TLPFDCQWDL IISDMYDP — — ITKNIGEY 

MHV SILV DNDINPFVSD SVASYYGNCI TLPIACQWDL IISDMYDP — — LTKNIGEY 

AIPV TLLV DNDIVDYVSD AHVSVLSDCN KYNTEHKFDL VISDMYTDND SKRKHEGVIA 

SARS Gov TLLV DSDLNDFVSD ADSTLIGDCA TVHTANKWDL IISDMYDP — — RTKHVTKE 

I I ....I I I I I I I I I I 

2645 2655 2665 2675 2685 2695 

EMCR NVSKDGFFTY LNGVIREKLA IGGSVAIKIT EYSWNKYLYE LIQRFAFWTL FCTSVNTSSS 

229E NVSKEGFFTY INGFICEKLA IGGSIAIKVT EYSWNKKLYE LVQRFSFWTM FCTSVNTSSS 

PEDV NVSKEGFFPY INGVITEKLA LGGTVAIKVT EFSWNKKLYE LIQKFEYWTM FCTSVNTSSS 

TGEV NTSKDGFFTY INGFIKEKLS LGGSVAIKIT EFSWNKDLYE LIQRFEYWTV FCTSVNTSSS 

BOCOV VVRCS YI HCHMIRDKLA LGGSVAIKIT EFSWNAELYK LMGYFAFWTV FCTNANASSS 

OC43 NVSKDGFFTY ICHMIRDKLA LGGSVAIKIT EFSWNAELYK LMGYFAFWTV FCTNANASSS 

MHV NVSKDGFFTY LCHLIRDKLA LGGSVAIKIT EFSWNAELYS LMGKFAFWTI FCTNVNASSS 

AIPV NNGNDDVFIY LSSFLRNNLA LGGSFAVKVT ETSWHEVLYD lAQDCAWWTM FCTAVNASSS 

SARS CoV NDSKEGFFTY LCGFIKQKLA LGGSIAVKIT EHSWNADLYK LMGHFSWWTA FVTNVNASSS 

....I.. ..I ....I. ...I ....I.. ..I I.. ..I I..,. I 

2705 2715 2725 2735 2745 2755 

EMCR EAFLIGINYL GDFIQGPFIA GNTVHANYIF WRNSTIMSLS YNSVLDLSKF ECKHKATVW 

229E EAFWGINYL GDFAQGPFID GNIIHANYVF WRNSTVMSLS YNSVLDLSKF NCKHKATVW 

PEDV EAFLIGVHYL GDFASGAVID GNTMHANYIF WRNSTIMTMS YNSVLDLSKF NCKHKATWV 

TGEV EGFLIGINYL GPYCDKAIVD GNIMHANYIF WRNSTIMALS HNSVLDTPKF KCRCNNALIV 

BoCoV EGFLIGINYL GK— PKVEID GNVMHAIICF G EIPQFGTGVL 

OC43 EGFLIGINYL CK— PKVEID GNVMHANYLF WRNSTVWNGG AYSLFDMAKF PLKLAGTAVI 

MHV EGFLIGINML NR— TRTEID GKTMHANYLF WRNSTMWNGG AYSLFDMSKF PLKVAGTAW 

AIPV EAFLIGVNYL GAS-EKVKVS GKTLHANYIF WRNCNYLQTS AYSIFDVAKF DLRLKATPW 

SARS CoV EAFLIGANYL GK— PKEQID GYTMHANYIF WRNTNPIQLS SYSLFDMSKF PLKLRGTAVM 

I 1 ....I I I I 1 1 

2765 2775 2785 2795 

EMCR TLKDSDVNDM VLSLIKSGRL LLRNSGRFGG FSNHLVSTK- 

22 9E QLKDSDINEM VLSLVRSGKL LVRGNGKCLS FSNHLVSTK- 

PEDV NLKDSSISDV VLGLLKNGKL LVRNNDAICG FSNHLVNVNK 

TGEV NLKEKELNEM VIGLLRKGKL LIRNNGKLLN FGNHFVNTP- 

BOCoV lACLIWLNSR LSWLVMP 

OC43 NLRAOQINDM VYSLLEKGKL LIRDTNKEVF VGDSLVNVI- 

MHV SLKPDQINDL VLSLIEKGKL LVRDTRKEVF VGDSLVNVK- 

AIPV NLKTEQKTDL VFNLIKCGKL LVRDVGNTSF TSDSFVCTM- 

SARS CoV SLKENQINDM lYSLLEKGRL IIRENNRVVV 5SDILVNN — 
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d Putative Orf lab 

— I — I — I — I — I — I — I. ...I — I — I — I — I 

5 15 25 35 45 55 

EMCR M FYNQVTLAVA SDSEISGFGF AIPSVAVRAY SEAAAQGFQA 

229E M ACNRVTLAVA SDSEISANGC STIAQAVRRY SEAASNGFRA 

PEDV M ASNHVTLAFA NDAEISAFGF CTASEAVSYY SEAAASGFMQ 

TGEV M SSKQFKILVN EDYQVNVPSL PIR-DVLQEI KYCYRNGFEG 

OV43 MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDMIC STTAQKLETD GICPENHVMV 

BoCoV MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDIVC STTAQKLETG GICPENHVMV 

HHV MAKMGKYGLG FKWAPEFPWM LPNASEKLGS PERSEEDGFC PSAAQEPKTK GKTLINHVRV 

AIBV MASSLKQGVS PKPRDVILVS KDIPEQLCDA LFFYTSHNPK 

SARS COV — MESLVLGV NEKTHVQLSL PVLQVRDVLV RGFGDSVEEA LSEAREHLKN 

I I I I I I I I I I I I 

65 75 85 95 105 115 

EMCR CRFVAFGLQD CVTGINDDDY VXALTG TNQLCAKILL FSDRPLNLRG 

229E CRFVSLDLQD CIVGIADDTY VMGLHG NQTLFCNIMK FSDRPFMLHG 

PEDV CRFVSLDLAD TVEGLLPEDY VMWIG TTKLSAYVDT FGSRPRNICG 

TGEV YVFVPEYCRD LVDCDRKDHY VIGVLG NGVSDLKPVL LTEPSVMLQG 

OV43 DCRRLLKQEC CVQSSLIREI VMNASPYDLE VLLQDALQSR EAVLVTTPLG MSLEACYVRG 

BoCoV DCRRLLKQEC CVQSSLIREI VMNTRPYDLE VLLQDALQSR EAVLVTPPLG MSLEACYVRG 

MHV DCSRLPALEC CVQSAIIRDI FVDEDPLNVE ASTMMALQFG SAVLVKPSKR LSIQAWAKLG 

AIBV DYADAFAVRQ KFDRSLQTGK QFKFET V CGLFLLKGVD KITPGVPAKV 

SARS CoV GTCGLVELEK GVLPQLEQPY VFIKR — SDA LSTNHGHKW ELVAEMDGIQ YGRSGITLGV 

I I I I I I I I I I I I 

125 135 145 155 165 175 

EMCR WLIFSNSNYV LQDFDWFG- -HGAGSWFV DKYMCGFDGK PVLP — KNMW EFRDYFNDNT 

229E WLVFSNSNYL LEEFDWFGK -RGGGNVTYT DQYLCGADGK PVMS — EDLW QFVDHFGENE 

PEDV WLLFSNCNYF LEELELTFG- -RRGGNIVPV DQYMCGADGK PVLQ — ESEW EYTDFFADSE 

TGEV FIVRANCNGV LEDFDLKIA- -RTGRGAIYV DQYMCGADGK PVIE— G DFKDYFGDED 

OV43 CNPKGWTMGL FRRRSVCNTG RCTVNKHVAY QLYMIDPAGV CLGAGQFVGW VIPLAFMPVQ 

BoCoV CNPNGWTMGL FRRRSVCNTG RCAVNKHVAY QLYMIDPAGV CFGAGQFVGW VIPLAFMPVQ 

MHV VLPKTPAMGL FKRFCLCNTR ECVCDAHVAF QLFTVQPDGV CLGNGRFIGW FVPVTAIPAY 

AIBV LKATSKLADL EDIFGVSPLA RKYRELLKTA CQWSLTVEAL DVR AQ TLDEIFDPTE 

SARS COV LVPHVGETPI AYRNVLLRK NGNKGAGG HSYGIDLKSY DLG— DELGT DPIEDYEQNW 

I.... I I I I I I I I I 1 I 

185 195 205 215 225 235 

EMCR DS-IVIGGVT YQLAWDVIRK DLSYEQQNVL AIESIHYLG- TTGHTLKSGC KLINAKPPKY 

22 9E E — IIINGHT YVCAWLTKRK PLDYKRQNNL AIEEIEYVHG DALHTLRNGS VLEMAKEVKT 

PEDV DGQLNIAGIT YVKAWIVERS DVSYASQNLT SIKSITYCS- TYEHTFLDGT AMKVARTPKI 

TGEV — IIEFEGEE YHCAWTTVRD EKPLNQQTLF TIQEIQYNL- DIPHKLPNCA TRHVAPPVKK 

OV43 SRKFIVPWVM YLRKRGEKGA YNKDHGRGGF GH-VYDFKVE DAYDQVHDEP KGKFSKKAYA 

BoCoV SRKFIAPWVM YLRKCGEKGA YIKDYKRGGF EH-VYNFKVE DAYDLVHDEP KGKFSKKAYA 

MHV AKQWLQPWSI LLRKGGNKGS VTSGHFRRAV TMPVYDFNVE DACEEVHLNP KGKYSRKAYA 

AIBV ILWLQVAAKI HVSSMAMRRL VGEVTAKVMD ALG SNLSALFQIV KQQIARIFQK 

SARS CoV NTKHGSGALR ELTRELNGGA VTRYVDNNFC GPDGYPLDCI KDFLARAGKS MCTLS-SQLD 

I 1 I I I I I I I I I I 

245 255 265 275 285 295 

EMCR SSKVVLSGEW NAVYKAFGSP FITNGISLLD IIVKPVFFNA FVKCNCGSEN WSVGAWDGYL 

229E SSKVVLSDAL DKLYKVFGSP VMTNGSNILE AFTKPVFISA LVQCTCGTKS WSVGDWTGFK 

PEDV KKNWLSEPL ATIYREIGSP FVDNGSDARS IIRRPVFLHA FVKCKCGSYH WTVGDWTSYV 

TGEV NSKIVLSEDY KKLYDIFGSP FMGNGDCLSK CFDTLHFIAA TLRCPCGSES SGVGDWT6FK 

OV43 LIRGYRGVKP LLYVDQYGCD YTGSLADGLE AYADKTLQEM KALFPTWSQE LLFDVIVAWH 

BoCoV LIRGYRGVKP LLYVDQYGCD YTGGLADGLE AYADKTLQEM KALFPIWSQE LPFDVTVAWH 

MHV LLKGYRGVKS ILFLDQYGCD YTGRLAKGLE DYGDCTLEEM KELFPVWCDS LDNEVVVAWH 

AIBV ALAIFENVNE LPQRIAALKM AFAKCARSIT VVVVERTLVV KEFAGTCLAS INGAVAKFFE 

SARS CoV YIESKRGVYC CRDHEHEIAW FTERSDKSYE HQTPFEIKSA KKFDTFKGEC PKFVFPLNSK 

I 1 I I I I I I I I I I 

305 315 325 335 345 355 

EMCR SSCCGTPAKK LCWPGNVVP GDVIITSTDA GCGVKYYAGL VVKHITNITG VSLWRVTAVH 

229E SSCCNVISNK LCVVPGNVKP GDAVITTQQA GAGIKYFCGM TLKFVANIEG VSVWRVIALQ 

PEDV STCCGFKCKP VLVASCSAMP GSVWTRAGA GTGVKYYNNM FLRHVADIDG LAFWRILKVQ 

TGEV TACCGLSGKV KGVTLGDIKP GDAVVTSMSA GKGVKFFANC VLQYAGDVEG VSIWKVIKTF 

0V43 VVRDPRYVMR LQSAATIRSV AYVANPTEDL CDGSWIKEP VHVYADDSII LRQYNLVDIM 

BoCoV VVRDPRYVMR LQSASTIRSV AYVANPTEDL CDGSWIKEP VHVYADDSII LRQHNLVDIM 

MHV VDRDPRAVMR LQTLATIRSI GYVGQPTEDL VDGDVWREP AHLLAANAIV KRLPRLVETM 

AIBV ELPNGFMGSK IFTTLAFFKE AAVRWENIP NAPRGTKGFE WGNAKGTQV VVRGMRNDLT 

SARS CoV VKVIQPRVEK KKTEGFMGRI RSVYPVASPQ ECNNMHLSTL MKCNHCDEVS WQTCDFLKAT 
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I I I I I I I 1 I I I I 

365 375 385 395 405 415 

EMCR SDGMFVATSS YDALLHRNSL DPFCFDVNTL LSNQLEUAFL GASVTEDVKF A AST 

22 9E SVDCFVASST FVEEEHVNRM DTFCFNVRNS VTDECRLAML GAEMTSNVRR Q VAS 

PEDV SKDDLACSGK FLEHHEEGFT DPCYFLNDSS LATKLKFDIL SGKFSDEVKQ A IIA 

TGEV TVDETVCTPG FEGELN DFIKPESKSL VACSVKRAFI TGDIDDAVHD C I IT 

OV43 SHFYMEADTV VNAFYGVALK DCGFVMQFGY IDCEQDSCDF KGWIPGNMID G FACTTC 

BoCoV SCFYMEADAV VNAFYGVDLK DCGFVMQFGY IDCEQDLCDF KGWVPGNMID G FACTTC 

MHV LYT DSSV TEFCYKTKLC DCGFITQFGY VDCCGDACDF RGWVPGNMMD G FLCPGC 

AIBV LLDQKADIPV EPEGWS — AILDGHLC YVFRSGDRFY AAPLSGNFAL S 

SARS Gov CEHCGTENLV lEGPTTCGYL PTNAVVKMPC PACQDPBIGP EHSVADYHNH SNIETRLRKG 

1 I I I I I I I I I I I 

425 435 445 455 465 475 

EMCR GVIDISAGMF GLYDDILTNN KPWFVRKASG LFDAIWDAFV AAIKLVPTTT GGLVRFVKSI 

22 9E GVIDISTGWF DVYDDIFAES KPWFVRKAED IFGPCWSALA SALKQLKVTT GELVRFVKSI 

PEDV GHVVVGSALV DIVDDALG — QPWFIRKLGD LASAPWEQLK AWRGLGLLS DEVVLFGKRL 

TGEV GKLDLSTNLF GNVGLLFKK- TPWFVQKCGA LFVDAWKVVE ELCGSLTLTY KQIYEVVASL 

OV43 GHVYEVGDLI AQSSGVLPVN PVLHTKSAAG YGGFGCKDSF TLYGQTWYF GGCVYWSPAR 

BoCoV GHVYETGDLL AQSSGVLPVN PVLHTKSAAG YGGFGCKDSF TLYGQTWYF GGCVYWSPAR 

MHV SKSYMPWELE AQSSGVIPKG GVLFTQSTDT VN RESF KLYGHAVVPF GSAVYWSPYP 

AIBV -DVHCCERVV CLSDGVTP — — EIN— DGL ILAAIYSSFS VSELVTALKK GEPFKFLGHK 
SARS COV GRTRCFGGCV FAYVGCYNKR AYWVPRASAD IG SGHT GITGDNVETL NEDLLEILSR 

I I I I I 1 I I I I I I 

485 495 505 515 525 535 

EMCR ASTVLTVSNG VIIMCADVPD AFQPVYRTFT QAICAAFDFS LDVFKIG 

22 9E CNSAVAWGG TIQILASVPE KFLNAFDVFV TAIQTVFDCA VETCTIA 

PEDV SCATLSIVNG VFEFLADVPE KLAAAVTVFV NFLNEFFESA CDCLKVG 

TGEV CTSAFTIVNY KPTFVVPD-N RVKDLVDKCV KVLVKAFDVF TQIITIAG — 

OV43 NIWIPILKSS VKSYDSLVYT GVLGCKAIVK ETNLICKALY LDYVQHKCGN LHQRELLGVS 

BoCoV NIWIPILKSS VKSYDGLVYT GVVGCKAIVK ETNLICKALY LDYVQHKCGN LHQRELLGVS 

MHV GMWLPVIWSS VKSYADLTYT GWGCKAIVQ ETDAICRSLY MDYVQHKCGN LEQRAILGLD 

AIBV FVYAKDA AVSFTLAKAA TIADVLRLFQ SARVIAEDVW SSFTBKS 

SARS CoV ERVNINI VGDFHLNEEV AIILASFSAS TSAFIDTIKS LDYKSFKT-I VESCGNYKVT 

I I I I I I 1 I I I I I 

545 555 565 575 585 595 

EMCR DVKFKR LGDYVLTENA LVRLTTEVVR GVRD A 

229E GKAFDK VFDYVLLDNA LVKLVTTKLK GVRE R 

PEDV GKTFNK VGSYVLFDNA LVICLVKAKAR GPRQ A- 

TGEV — lEAKCFVL GAKYLLFNNA LVKLVSVKIL GKKQ K 

OV43 DVWHKQLLLN RGVYKPLLEN IDYFNMRRAK FSLETFT VCADGFMPFL LDDLVPRAYY 

BoCoV DVWHKQLLLN RGVYKPLLEN IDYFNMRRAK FSLETFT VCADGFMPFL LDDLVPRAYY 

MHV DVYHRQLLVN RGDYSLLLEN VDLFVKRRAE FACK-FA TCGDGLVPLL LDGLVPRSYY 

AIBV FEFWKLAYGK VRNLEEFVKT YVCK 

SARS CoV KGKPVKGAWN IGQQRSVLTP LCGFPSQAAG VIRSIFARTL DAANHSIPDL QRAAVTILDG 

....I I I I 1 I I.. ..I 1 I I I 

605 615 625 635 645 655 

EMCR -RIKKAMFTK VWGPTTEVK FSVIELATVN LRLVDCAPW CPKGKIWIA GQAFFYSGGF 

22 9E -GLNKVKYAT WVGSTEEVK SSRVERSTAV LTIANNYSKL FDEGYTWIG DVAYFVSDGY 

PEDV -GICEVRYTS LVVGSTTKVV SKRVENANVN LVWDEDVTL NTTGRTWVD GLAFFESDGF 

TGEV -GLECAFFAT SLVGATVNVT PKRTETATIS LNKVDDWAP G-EGYIVIVG DMAFYKSGEY 

OV43 LAVSGQAFCD YADKLCHAVV SKSKELLDVS LDSLGAAIHY LNSKIVDLAQ HFSDFGTSFV 

BoCoV LAVSGQAFCD YAGKICHAVV SKSKELLDVS VDSLGAAIHY LNSKIVDLAQ HFSDFGTSFV 

MHV LIKSGQAFTS MMVNFSHEVT DMCMDMALLF MHDVKVATKY VKKVTGKLAV RFJCALGVAW 

AIBV AQMSIVILAA VLGEDIWHLV SQVIYKLGVL FTKVVDFCDK HWKGFCVQLK RAKLIVTBTF 

SARS CoV ISEQSLRLVD AMVYTSDLLT NSVIIHAYVT GGLVQQTSQW LSNLLGTTVE KLRPIFEWIE 

I 1 I I I I I 1 I I I I 

665 675 685 695 705 715 

EMCR YRFMVDSTTV LNDPVFTGEL FYTIKFSGFK LDGFN HQFVNAS SATDAIIAVE 

22 9E FRLMASPNSV LTTAVYKPLF AFNVNVMGTR PE KFPTTV TCENLESAVL 

PEDV YRHLADADW lEHPVYKSAC ELKPVFECDP IP— D FPLPVAA SVAELCVQTD 

TGEV YFMMSSPNFV LTNNVFKAVK VPSYDIVYDV DNDTKSKMIA KLGSSFEYDG DIDAAIVKVN 

OV43 SKIVHFFKTF TTSTALAFAW VLFHVLHGAY IWESDIYFV KN-IPRYASA VAQAFQSVAK 

BoCoV SKIVHFFKTF TTSTALAFAW VLFHVLHGAY IVVESDIYFG KN-IPRYASA VAQAFRSGAK 

MHV RKITEWFDLA VDTAASAAGW LCYQLVNGLF AVANGGITFL SD-VPBLVKN FVDKFKVFFK 

AIBV CVLKGVAQHC FQLLLDAIHS LYKSFKKCAL GRIHG DLLFWKGG VHKIVQDGDE 

SARS COV AKLSAGVEFL KD AWE ILKFLITGVF DIVKGQIQVA SDNIKDCVKC FIDVVNKALE 

I I I I I I I I I I I I 

725 735 745 755 765 775 

EMCR LLLSDFKTAV FVYTCVVDGC SVIVRRDAT- FATHVCFKDC YSIWEQFCID NCGE 

229E FVNDKITEFQ LDYSIDVIDN EIIVKPNIS- LCVPLYVRDY VDKWDDFCRQ YSNE 

PEDV LLLKNYNTPY KTYSCVVRGD KCCITCTLQ- FKAPSYVEDA VN-FVDLCTK NIGT 

TGEV ELLIEFRQQS LCFRAFKDDK SIFVEAYFKK YKMPACLAKH IG-LWNIIKK DSCK 

OV43 VVLDSLRVTF IDGLSCFKIG RRRICLSGRK lYEVERG-LL HSSQLPLDVY DLTMPSQVQK 

BoCoV VGLDSLRVTF IDGLSCFKIG RRRICLSGSK lYEVERG-LL HSSQLPLDVY DLTMPSQVQK 

MHV VLIDSMSVSV LSGLTWKTA SNRVCLAGCK VYEWQK-RL SAYVMPVGCN EATC 

AIBV IWFDAIDSVD VEDLGVVQEK SIDFEVCDDV TLPENQPGHM VQIEDDGKNY MFFR 

SARS CoV MCIDQVTIAG AK-LRSLNLG EVFIAQSKGL YRQCIRGKEQ LQLLMPLKAP KEVT 
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EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



785 



1 



..I I 

795 

PH FLTDYNAILQ 

SW FEDDYRAFIS 

AG FHEFYITAHE 

RG FLNLFNHLNE 

AKQKPIYLKG SGSOFSLADS 
TKQKGIYLKG SGSOFSLADS 

LVG EIEPAVVEDD 

FKK DENIYYTPMS 

FLEG DSHDTVLTSE 



805 
SNNPQCAIVQ 
VLDITOAAVK 
QQDLQGFLTT 
LEDIKETNIQ 
VVEVVTTSLT 
VVEWTTSLT 
VVDWKAPLT 
QLGAINWCK 
EWLKNGELE 



....I. ...I I 
815 825 

ASBSK VLLERFLP 

AAESK AFVDTIVP 

CCTMSG F-ECFMPTIP 

AIKN 1- 

PCG YS EPPKVADKIC 

PCG YS EPPKVADKIC 

YQG CC KPPTSFEKIC 

AGG KTVT 

ALETPVDSFT NGAIVGTPVC 



I I 

835 
KCPEILLSID 
PCPSILKVID 
QCPAVLBEID 
LCPDPLLDLD 
IVDNVYMAKA 
IVDNVYMAKA 
VVDKLYMAKC 
FGETTVQEIP 
VNGLMLLEIK 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



I I 

845 
DGHLWNLFVE 
GGKIWNGVIK 
GGSIWRSFIT 
YGAIWYNCMP 
GDKYYPVWD 
GDKYYPVVVD 
GDQFYPVWD 
PPDVVPIKVS 
DKEQYCALSP 



I I 

855 

K 

N 

G 

-DHVGLLDQA 
-GHVGLLOQA 
NDTIGVLDQC 

— GLIATNNV 



I I 

865 
-FNFVTDWLK 
-VNSVRDWLK 
-LNTMWDFCK 
-CSDP-SVLG 
WRVPCAG--R 
WRVPCAG— R 
WRFPCAG— K 
— lECCG — E 
FRLKGGAPIK 



I 



875 
TLKLTLTSNG 
SLKLNLTQQG 
RLKVSFGLDG 
SVQLLIGNG- 
RVTFKEQPTV 
CVTFKEQPTV 
KVEFMDKPKV 
PWNTIFKKAY 
GVTFG-EDTV 



I } 

885 
LLGNCAKRFR 
LLGTCAKRFK 
IVVTVARKFK 
-VKWCDGCK 
KEIISMPKII 
NEIASTPKTI 
KEIPST-RKI 
KEPIEVDTDL 
WEVQGY-KNV 



I I 

895 
RVLVKLLDVY 
RWLGILLEAY 
RLGALLAEMY 
GFANQLSKGY 
KVFYELDNDF 
KVFYELDKDF 
KINFALDATF 
TVEQLLSVIY 
RITFELDERV 



I 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



905 
NGFLETVCSV 
NAFLDTVVST 
NTYLSTWEN 
NKLCNAARND 
NTILNTACGV 
NTILNTACGE 
DSVLSKACSE 
EKMCDDLKLF 
DBCVLNBKCSV 



1 1 

915 
VHTAGVCIKY 
VKIGGLTFKT 
LVLAGVSFKY 
lEIGGIPFST 
FEVDDTVDME 
FEVDDTVDME 
FEVDKDVTLD 
PEAPEPPPFE 
YTVESGTEVT 



I I 

925 
YAVNVP-YVV 
YAFDKP-YIV 
YATSVP-KIV 
FKTPTNTFIE 
BFYAWIDAI 
EFYAWIDAI 
ELLDWLDAV 
NVALVDKMGK 
EFACWAEAV 



I I 

935 
ISGFVSRVIR 
IRDIVCKVEN 
LGGCFHSVKS 
MTDAIYSVIE 
EEKLSPCKEL 
EEKLSPCKEL 
ESTLSPCKEH 
DLDCIKSCHL 
VKTLQPVSDL 



I I 

945 
RERCD — VTF 
KTEAEWIELF 
VFASV— FQI 

QGKALS 

EGVGAKVSAF 
EGVGAKVSAF 
DVIGTKVCAL 

lYR 

LTN MGID 



I I 

955 
PCVSCVTFFY 
PHNDRIKSFS 
PVQAGIEKFK 



LQKLEDNPLF 
LQKLEDNSLF 
LNRIiAEDYVY 



LDEWSVATFY 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



I I 

965 
EFLDTCFGVS 
TFESAYMPIA 
VFLNCVHPVV 
-FRDADVPVV 
LFDEAGEEVL 
LFDEAGEEVL 
LFDEGGEEVI 
-DYESDDDIE 
LFDDAGEENF 



I I 

975 

K PNAID 

D PTHFD 

PRVIE 

DNGTISTADW 
APKLYCAFTA 
APKLYCAFTA 
APKMYCSFSA 

E ED- 

SSRMYCSFYP 



I I 

985 
VEHLELKBTV 
lEEVELLDAE 
TSFVELEETT 
SEPILLBPAE 
P — EDDDFLE 
P — EDDDFLE 
P — DDEDCVA 
AEECDTDSGE 
PDEEEEDDAE 



I I 

995 
FVEPKDGGQF 
FVEPGCGGIL 
FKPPALNGGI 
YVKPKNNGNV 
ESDVEEDDVE 
ESGVEEDDVE 
ADVVDADENQ 
AEECDTNSEC 
CEEEEIDETC 



I I 

1005 
FVSDDYLWYV 
AVIDEHVFYK 
AIVDGFAFYY 
IVIAGYTFYK 
GEETDLTVTS 
GEETDLTVTS 
GDDADDSAAL 
EEEDEDTKVL 
EHEYGTEDDY 



I I 

1015 
V-DDIYYPAS 
K-DGVYYPSN 
D-GTLYYPTD 
DEDEHFYPYG 
AGQPCVASEQ 
AGEPCVASEQ 
VTDTQEEDGV 
ALIQDPASIK 
QGLPLEFGAS 



I. 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



1025 
CNGVLPVAFT 
GTNILPVAFT 
GNSWPICFK 
FGKIVQRMYN 
EESSEVLEDT 
EESSEILEDT 
AKGQVGVAES 
YPLPLDEDYS 
AETVRVEEEE 



I I 

1035 
KLAGGK— IS 
KAAGGK — VS 
KKGGGD— VK 
KMGGGDKTVS 
LDDGPSVETS 
LDDGPCVETS 
OARLDQVEAF 
VYNGCIVHKD 
EEDWLDDTTE 



I I 

1045 
FSDDVIVHDV 
FSDDVEVKDI 
FSDEVSVKTI 
FSEEVDVQEI 
DSQVEEDVEM 
DSQVEEDVQM 
DIEKVEDPIL 
ALDWNLPSG 
QSEIEPEPEP 



I 



1055 
EPTHKVKLIF 
EPVYRVKLCF 
DPVYKVSLEF 
APVTRVKLEF 
SDFVDLESVI 
SDFGDLESVI 
NELSABLNAP 
BETFVVNNCF 
TPEBPVNQFT 



. . I I 

1065 



BFB 

EFE 

EFE 

EFD 

QD 

QD 

ADKTYEOVLA FDAIYSEALS 

EG AVK 

GYLK LTDNVAI 



. . I I 

1075 

DDWT 

DEKLV 

SETIM 

---NEIVT 
— YENVCF 
— YENVCF 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



1085 
SLCKK— 
DVCEK— 
AVLNK— 
GVLER— 



1095 



S 

A 

• A 

A 

EFYTT— EPE FVKVLGLYVP 
EFYTT — EPE FVKVLDLYVP 
AFYAVPGDET HFKVCGFYSP 

PLPQK 

KCVDIVKEAQ SANPMVIVHA 



I I I I I I 1 I 

1105 1115 1125 1135 

FGKSIIYTG- DWEGLHEVLT SAMNVIG — QHIKLPQF 

IGKKIKHEG- DWDSFCKTIQ SALSVVS — CYVNLPTY 

VGNRIKVTG- GWDDVVEYIN VAIEVLK DHVEVPKY 

IGTRYKFTGT THEEFEESIS EBLOAIFDTI. ANQGVELEGY 
K— ATRNNCW LRSVLAVMQK LPCQFKDKNL QDLWVLYKQQ 
K — ATRNNCW LRSVLAVMQK LPCQFKDKNL QDLWVLYKQQ 
A — lERTNCW LRSTLIVMQS LPLEFKDLEM QKLWLSYKSS 

WDVLG DWGEAVDAQE QLCQQEP LQHTFE 

ANIHLKHGGG VAGALNKATN GAMQKESDDY IKLNGPLTVG 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



I I 

1145 
YIYDEEGGYD 
YIYDEEGGND 
YIYDBEGGTD 
FIYDTCGGFD 
YSQLFVDTLV 
YSQLFVDTLV 
YNKEFVDKLV 
EPVENSTGSS 
GSCLLSGHNL 



I I 

1155 
VSKP— VMIS 
LSLP—VMIS 
PNLP— VMVS 
IKNPD6IMIS 
NKIPANIVLP 
NKIPANIVVP 
KSVPKSIILP 
KTMTEQVWE 
AKKCLHVVGP 



I I 

1165 
QWPISDDSDG 
EWPLSVQQAQ 
QWPLNDDTIS 
QYDINITADB 
QGGYVADFAY 
QGGYVADFAY 
QGGYVADFAY 
DQELPVVEQD 
NLNAGEDIQL 



I I 

1175 
CVVEASTDFH 
QEATLPDIAE 
QDLLDVEVVT 
KSEVSASSEB 
WFLTLCDWQC 
WFLTLCDWQC 
FFLSQCSFKA 
QDWVYTPTD 
LKAAYENFNS 



I I 

1185 
Q — LESVREE 
D — VVDQVEE 
DAPIDSEGDE 
EE'VESVEED 
VAYWKCIKCD 
VAYWKCIKCD 
YANWRCLKCD 
LEVAKETAEE 
QDILLAPLLS 



I I 

1195 

VD 

VNS 

VDSSAPEKVA 
PENEIVEASB 

LALK LKG 

LALK LKG 

MDLK LQG 

VD 

AGIFG — AKP 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



62/87 



PCT/NL2004/000805 



EMCR 

229G 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



1205 



I ....I. ...I 
1215 



1225 



HE 

IFD 

D VANSEPGDDG LPVAPETNVE 

GAEGTSSQEB VBTVEVADIT STEEDVDXVE 
LDAMFFYGDV VSHICKCGES MVLIDVDVPF 
LDAMFFYGDV VSHVCKCGES MVLIDVDVPF 
LDAMFFYGDV VSHVCKCGTG MTLLSADIPY 

EFIL 

LQSLQVCVQT VRTQVYIAVN DKALYEQVVM 



1235 
QPFGEVEHAL 
lETVDVKHDV 
SEVEEVAATL 
VSAKDDPWAA 
TAHFALKDKL 
TAHFALKDKL 
TLHFGLRDDK 
IFAVPKEEVV 
DYLDNLKPRV 



1245 1255 

SIRQ PFSFSFR 

S PFEMPFE 

SFIKDTPSTV TKDPFAFDFV 
AVDVQEAEQF NPSLPPFKTT 
FCAFITKRIV YKAACWDVN 
FCAFITKRSV YKAACWDVN 
FCAFYTPRKV FRAACVVDVN 

S QKDGA 

EAPKQEEPPN TEDSKTEEKS 



I I I I I I I I I I I I 

1265 1275 1285 1295 1305 1315 

EMCR DELGVRVLDQ SDNNCWISTT LIQLQLTKLL DDSIEMQLFK VGKVDSIVQK CYELSHLISG 

229E ELNGLKILKQ LDNNCWVNSV MLQIQLTGIL DGDYAMQFFK MGRVAKMIER CYTAEQCIRG 

PEDV SYGGLKVLRQ SHNNCWVTST LVQLQLLGIV D-DPAMELFS AGRVGPMVRK CYESQKAILG 

TGEV NLNGKIILKQ GDNNCWINAC CYQLQAFDFF N-NEAWEKFK KGDVMDFVNL CYAATTLARG 

OV43 DSHSMAWDG KQIDDHRITS ITSDKFDFII G-HGMSFSMT TFEIAQLYGS CITP-NVCFV 

BoCoV DSHSMAWDG KQIDDHRITS ITSDKFDFII G-HGTSFSMT TFEIAQLYGS CITP-NVCFV 

MHV DCHSMAWDG KQIDGKWTK FNGDKYDFMV G-H(»IAFSMS AFEIAQLYGS CITP-NVCFV 

AIBV QIKQEPIQVV KPQREKKAKK FKVKPATCEK P K- FLEYKTCVGD LTVVIAKALD 

SARS CoV VVQKPVDVKP KIKACIDEVT TTLEETKFLT N-KLLLFADI NGKLYHDSQN MLRGEDMSFL 



I I I I I I I I I 1 I I 

1325 1335 1345 1355 1365 1375 

EMCR SLGDSGKLLS ELLKDKYTCS ITFEMSCDCG KKFDEQVGCL FWIMPYTKLF QKGECCICHK 

22 9E AMGDVGLCMY RLLKDLHTGF MVMDYKCSCT SGRLEESGAV LFCTPTKKAF PYGTCLNCNA 

PEDV SLGDVSACLE SLTKDLHTLK ITCSWCGCG TGERIYEGCA FRMTPTLEPF PYGACAQCAQ 

TGEV HSGDAEYLLE LMLNDYSTAK IVLAAKCGCG EKEIVLERAV FKLTPLKBSF NYGVCGDCMQ 

OV43 KGDIIKVSKL VtU^EWVNPA NGHMAHGGGV AKAIAVAAGQ QFVKETTDMV KSKGVCATGD 

BoCoV KGDIIKVSKR VKAEVVVNPA NGHMAHGGGV AKAIAVAAGQ QFVKETTDMV KSKGVCATGD 

MHV KGDVIKVLRR VGAEVIVNPA NGRMAHGAGV AGAIAKAAGK SFIKETADMV RNQGVCQVGE 

AIBV EFKEFCIVNA ANEHMTHGSG VAKAIADFCG LDFVEYCEDY VKKHGPQQRL VTPSFVKGIQ 

SARS CoV EKDAPYMVGD VITSGDITCV VIPSKKAGGT TEMLSRALKK VPVDEYITTY PGQGCAGYTL 



I I I I I I I I I I I I 

1385 1395 1405 1415 1425 1435 

EMCR MQTYKLVSMK GTGVFVQD — PAPIDIDAFP VRPICSSVYL GVKGSGHYQT NLYSFDKAID 

229E PRMCTIRQLQ GTIIFVQQK- PEPVNPVSFV VKPVCSSIFR GAVSCGHYQT NIYSQNLCVD 

PEDV VLMHTFKSIV GTGIFCRD — TTALSLDSLV VKPLCAAAFI GK-DSGHYVT NFYDAAMAID 

TGEV VNTCRFLSVE GSGVFVHDIL SKQTPEAMFV VKPVMHAVYT GTTQNGHYMV DDIEHGYCVD 

OV43 CYVSTGGKLC KTVLNVVGPD ARTQGKQSYV LLERVYKHLN NYDCWTTLI SAGIFSVPSD 

BoCoV CYVSTGGKLC KTVLNVVGPD ARTQGKQSYA LLERVYKHLN KYDCVVTTLI SAGIFSVPSD 

MHV CYESTGGNLC KTVLNIVGPD ARGHGKQCYS FLERAYQHIN KCDDVVTTLI SAGIFSVPTD 

AIBV CVNNVVGPRH GDNNLHEKLV AAYKNVLVDG WNYWPVLS LGIFGVDFKM SIDAMREAFE 

SARS CoV EEAKTALKKC KSAFYVLPSE APNAKEEILG TVSWNLREML AHAEETRKLM PICMDVRAIM 



EMCR 

229E 

PEDV 

TGEV 

OV4 3 

BoCcV 

MHV 

AIBV 

SARS CoV 



I ( 

1445 
GFGVFDIK — 
GFGVNKIQP- 
GYGRHQIK — 
GMGIKPLKKR 
VSLTYLLGTA 
VSLTYLLGTA 
VSLTYLIGW 
GCTIRVLLFS 
ATIQRKYKGI 



. 1 



1455 



.1 I 

1465 

NSSV 

WTNDAL 

YDTL 

CYTSTLFINA NVMTRAEKPK 
KKQWLVSNN QEDFDLISKC 
KKQVVLVSNN QEDFDLISKC 
TKNVILVSNN KDDFDVIEKC 

LSQE 

KIQEGIVDYG VRFFFYTSKB 



.1 



1475 
NTVCFVDVDF 
NTICIKDADY 
NTICVKDVNW 
QEFKVEKVEQ 
QITAVEG-TK 
QITAVEG-TK 
QVTSIAG-TK 
HIDYFDVTCK 
PVASIITKLN 



I I 

1485 
HS-VEIEAGE 
NAKVEISVTP 
TAPLVPAVDS 
QPIVEENKSS 
KLAARLSFNV 
KLAERLSFNV 
ALSLQLAKNL 
QKTIYLTEDG 
SLNEPLVTMP 



I I 

1495 

VK 

IKNTVDTtPK 

WEP 

lEKEEIQSPK 
GRSIVYETDA 
GRSIVYETDA 
CRDVKFETNA 

VKYR 

IGYVTHGFNL 



I I 1 I I I I I I I I I 

1505 1515 1525 1535 1545 1555 

EMCR PFAVYKNVKF YLGDISHLVN CVSFDFWNA ANENLMHGGG VARAIDILTE 

22 9E EEFWKEKLN AFLVHDNVAF YQGDVDTVVN GVDFDFIVNA ANENLAHGGG LAKALDVYTK 

PEDV VVK PFYSYKNVDF YQGDFSDLVK -LPCDFWNA ANEKLSHGGG lAKAIDVYTK 

TGEV ND DLIL PFYKAGKLSF YQGALDVLIN FLEPDVIVNA ANGDLKHHGG VARAIDVFTG 

OV43 NKLILIN DVAFVSTFNV LQDVLSLRHD lALDDDARTF VQSNVDWPE GWRVVNKFYQ 

BoCoV NKLILSN DVAFVSTFNV LQDVLSLRHD lALDDDARTF VQSNVDWPE GWRVVNKFYQ 

MHV CDSLFS DSCFVSSYDV LQEVELLRHD IQLDDDARVF VQAHMDNLPA DWRLVNKFDS 

AIBV SIVLKPG DSLGQFGQVY AKNKIVFTAD DVEDKEILYV 

SARS CoV EEAARCMR — SLKAPAWSV SSPDAVTTYN GYLTSSSKTS EEHFVETVSL AGSYRDWSYS 



I I I I I I I I I I I I 

1565 1575 1585 1595 1605 1615 

EMCR GQLQSLSKDY ISSNGPLKVG AGVMLE— CE KFN— VFNW GPRTG KHEHSLLVEA 

229E GKLQRLSKEH IGLAGKVKVG TGVMVB— CD SLR--IFNVV GPRKG KHERDLLIKA 

PEDV GMLQKCSNDY IKAHGPIKVG RGVMLE— AL GLK — VFNW GPRKG KHAPELLVKA 

TGEV GKLTERSKDY LKKNKSIAPG NAVFFENVIE HLS~VLNAV GPRNGD SRVEAKLCNV 

OV43 INGVRT-VKY FECTGGIDIC SQDKVFGYVQ QGIFNKATVA QIKALF LDKVDILLTV 

BoCoV INGVRP-VKY FECPGGIDIC SQDKVFGYVQ QGSFNKATVA QIKALF LDKVDILLTV 

MHV VDGVRT-VKY FECPGEIFVS SQGKKFGYVQ NGSFKVASVS QIRALL ANKVDVLCTV 

AIBV PTTDKSILEY YGLDAQKYVI YLQTLAQKWN VQYRDNFLIL EWRDGN — CW ISSAIVLLQA 

SARS CoV GQRTELGVEF LKRGDKIVYH TLESPVEFHL DG — EVLSLD KLKSLLSLRE VKTIKVFTTV 
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I 1 I I I I I I 1 I I I 

1625 1635 1645 1655 1665 1675 

EMCR yNSILF£NGI PLMPLLSCGI FGVRIENSLK ALFSCOINKP LQVFVYSSNE EQAVLKFLDG 

229E YNTINNEQGT PLTPILSCGI FGIKLETSLE VLLDVCNTKE VKVFVYTDTE VCKVKDFVSG 

PEOV YKSVFANSGV ALTPLISVGZ FSVPLEBSLS AFLACVGDRH CKCFCYGOKE RBAIIKYMDG 

TGEV YKAIAKCEGK ILTPLISVGI FNVRLETSLQ CLLKTVNDRG LNVFVYTDQE RQTIENFPS- 

OV43 DGVNFTNRFV PVGESFGKSL GNVFCDGVNV TKHKCDINYK GKVFFQFDNL SSEDLKAVRS 

BoCoV DGVNFTNRFV PVGESFGKSL GNVFCDGVNV TKHKCDINYK GKVFFQFDNL SSEDLKAVRS 

MHV DGVNFRSCCV AEGEVFGKTL GSVFCDGINV TKVRCSAIHK GKVFFQYSGL SAADLVAVTD 

AIBV AKIRFKGFLT EAWAKLLGGD PTDFVAWCYA SCTAKVGDFS DANWLLANLA EHFDADYTNA 

SARS COV DNTNLHTQLV DMSMTYGQQF GPTYLDGADV TKIKPHVNHE GKTFFVLPSD DTLRSEAFEY 

I I I I I I I I I I I 1 

1685 1695 1705 1715 1725 1735 

EMCR LDLTPVID DVDV V KPFRVEGN FSFFDCG VNALDGD-IY 

229E LVNVQKVE QPKI EPKPVSVIKV APKPYRVDGK FSYFTED LLCVADDKPI 

PEDV LVDAIFKEAL VDTTPVQEDV QQVSQKPVLP NFEPFRIEGA HAFYECNPEG LMSLGAD-KL 

TGEV 

OV43 SFNFDQKELL AYYNMLVN — CFKWQVVVNG KYFTFKQANN NCFVNVSCLM LQSLHLTFKI 

BoCoV SFNFDQKELL AYYNMLVN — CSKWQVVFNG KYFTFKQANN NCFVNVSCLM LQSLNLKFKI 

MHV AFGFDEPQLL KYYNMLG HCKWPVWCG NYFAFKQSNN NCYINVACLM LQHLSLKFHK 

AIBV FLKKRVSCN 

SARS CoV YHTLDESFLG RYMSALNH— TKKWKFPQVG GLTSIKWADN NCYLSSVLLA LQQLEVKFNA 

I I I 1 I I I I 1: I I 1 

1745 1755 1765 1775 1785 1795 

EMCR LLFTNSILML DKQGQL LDTKLNGILQ QAVLDYLATV KTVPAGNLVK LWE-SCTIY 

229E VLFTDSMLTL DDRGLA LDNALSGVLS AAIKDCVDIN KAIPSGNLIK FDIG-SVVVY 

PEDV VLFTNSNLDF CSVGKC LNDVTSGALL EAINVFKKSN KTVPAGNCVT LDCANMISIT 

TGEV 

OV43 VQWQEAWLEF RSGRPARFVA LVLAKGGFKF GDPADSRDFL RWFSQVDLT GAICDFEIAC 

BoCoV VQWQEAWLEF RSGRPARFVS LVLAKGGFKF GDPADSRDFL RWFSQVDLT GAICDFEIAC 

MHV WQWQEAWNBF RSGKPLRFVS LVLAKGSFKF NEPSDSTDFM RWLREADLS GATCDFEFVC 

AIBV 

SARS CoV PALQEAYYRA RAGDAANFCA LILAYSNKTV GELGDVRETM THLLQHANLE SAKRVLNWC 

I I I 1 I I I I I I I I 

1805 1815 1825 1835 1845 1855 

EMCR M-CWPSIND LSFDKNLGRC VRKLNRLKTC VIANVPAIDV LKKLLSSLTL TVKFWESNV 

229E M-CWPSEKD KHLDNNVQRC TRKLNRLMCD IVCTIPADYI LPLVLSSLTC NVSFVGELKA 

PEDV M-VVLPFDGD ANYDKNYARA WKVSKLKGK LVLAVDDATL YSKLS~HLS VLGFVSTPDD 

TGEV — CSIP 

OV43 K-CGVKQEQR TGLDAVMHFG TLSREDLEIG YTVDCSCG KKLIHCVRF DVPFLICSNT 

BoCoV K-CGVKQEQR TGVDAVMHFG TLSREDLEIG YTVDCSCG KKLIHCVRF DVPFLICSNT 

MHV K-CGVKQEQR KGVDAVMHFG TLDKGDLAKG YTIACTCG — -NKLVHCTQL NVPFLICSNK 

AIBV — CGIKSYEL RGLEACIQP- V RATN 

SARS CoV KHCGQKTTTL TGVEAVMYMG TLSYDNLKTG VSIPCVCGR- -DATQYLVQQ ESSFVMMSAP 

I I I I I I I I I I I I 

1865 1875 1885 1895 1905 1915 

EMCR MDVNDCFKND NWLKITEDG INVKDVWES SKSLGKQLG- WSDGVDSFE GVLP — INTD 

229E AEA K VITIKVTEDG VNVHDVTVTT DKSFEQQVG- VIADKDKDLS GAVPSDLNTS 

PEDV VER— FYANK SWIKVTEDT RSVKAVKVES TATYGQQIG- PCLVNDTVVT DNKP— WAD 

TGEV — VN VTEDN VNHERVSVSF DKTYGEQLKG TWIKDKDVT NQLPSAFDVG 

OV43 PASVKLPKG- VGSANIFIG^ DKVGHYVHVK CEQSYQLYDA SNVKKVTDVT GKLSDCLYLK 

BoCoV PASVKLPKG- VGSANIFKG- DKVGHYVHVK CEQSYQLYDA SNVKKVTDVT GNLSDCLYLK 

MHV PEGKKLPDD- VVAANIFTG- GSLGHYTHVK CKPKYQLYDA CNVSKVSEAK GNFTDCLYLK 

AIBV LLHFK TQYSNCPTCG ANNTDEVIEA SLPYLLLFAT DGPATVDCDE DAVG 

SARS GOV PAEYKLQQGT FLCANEYTGN YQCGHYTHIT AKETLYRIDG AHLTKMSEYK GPVTDVFY-K 

I I I I I I I I I I I I 

1925 1935 1945 1955 1965 1975 

EMCR TVLSVAPEVD WVAFYGFEKA ALFASLDVKP YGYPNDFVGG FRVLGTTDNN CWVNATCIIL 

22 9E ELLTKAIDVD WVEFYGFKDA VTFATVDHSA FAYESAVVNG IRVLKTSDNN CWVNAVCIAL 

PEDV WAKWPNAN WDSHYGFDKA GEFHMLDHTG FTFPSEVVNG RRVIKTTDNN CWVNVTCLQL 

TGEV QKVIRAIDID WQAHYGFRDA AAFSASSHDA YKFEWTHSN FIVHKQTDMN CWINAICLAL 

OV43 NLKQTFKSVL TTYYLDDVKK lEYKPDLSQY YCDGGKYYTQ RIIKAQFKTF EKVDGVYTNF 

BoCoV NLKQTFKSVL TTYYLDDVKK lEYKPDLSQY YCDGGKYYTQ RIIKAQFKTF EKVDGVYTNF 

MHV NLKQTFSSKL TTFYLDDVKC VEYNPDLSQY YCESGKYYTK PIIKAQFRTF EKVEGVYTNF 

AIBV TVVFVGSTNS GHCYTQAAGQ AFDNLAKDRK FGKKSPYITA MYTRFAFKNE TSLPVAKQSK 

SARS CoV ETSYTTTIKP VSYKLDGVTY TEIEPKLDGY YKKDNAYYTE QPIDLVP-TQ PLPNASFDNF 

I I I I I I 1 I I 1 I I 

1985 1995 2005 2015 2025 2035 

EMCR QYLKPTFKSK GLNVLWNKFV TGDVGPFVSF XYFITMSSKG QKGDAEEALS KLSEYLISDS 

229E QYSKPHFISQ GLDAAWNKFV LGDVEIFVAF VYYVARLMKG DKGDAEDTLT KLSKYLANEA 

PEDV QFARFRFKSA GLQAMWESYC TGDVAMFVHW LYWLTGVDKG QPSDSENALN MLSKYIVPAG 

TGEV QRLKPQWKFP GVRGLWNEFL ERKTQGFVHM LYHISGVKKG EPGDAELMLH KLGDLMDNDC 

OV43 KLIG— HTVC DSLNAKLGFD SSKEFVEYKI TEWPTATGDV VLATDDLYVK RYERGCITFG 

BoCoV KLIG — HTVC DILNAKLGFD SSKEFVEYKV TEWPTATGDV VLATDDLYVK RYERGCITFG 

MHV KLVG — HSIA EKFNAKLGFD CNSPFTEYKI TEWPTATGDV VLASDDLYVS RYSGGCVTFG 

AIBV GKSKS-VKED VSNLATSSKA SFDNLTDFEQ WYDSNIYESL KVQESPDNFD KYVSFTTKED 

SARS CoV KLTCSNTKFA DDLNQMTGFT KP-ASRELSV TFFPDLNGDV VAIDYRHYSA SFKKGAKLLH 
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EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



I I 

2045 
IVTLEQYSTC 
QVQLEHYSSC 
SVTIERVTHD 
EIIVTHTTAC 
KPVIWLSHEK 
KPVIWLSHEQ 
KPVIWLGHEE 
SKLPLTLKVR 
KPIVWHINQA 



1 



I 



I I 

2055 

Die 

VECDAKF 

GCC 

DKC 

ASLNSLT— 

ASLNSLT 

ASLKSLT 

GIKS 

TTKTTFKPNT WCLRCLWSTK 



2065 

KSTW 

KNSVA 

CSKR 

AKVE 

— YFNRP 
- — YFNRP 

YFNRP 

VV 



I I 

2075 
EVKSAVVCAS 
SINSAIVCAS 
WTAPWNAS 
KFVGPVVAAP 
SLVDDNKFDV 
LLVDENKFDV 
SWCENKFNV 
DFRSKDGFIY 
PVDTSNSFEV 



2085 2095 

VLKDGCOVG- 

VKRDGVQVG 

VLKLGVEDG- 

LAIHGTDE — 

LKVDDVD 

LKVDDVD 

LPVDVSEPTD KGPVPAAVLV 

KLTPDTD 

LAVEDTQGMD N 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS COV 



I I 

2105 

FCPHRH 

YCVHGI 

LCPHGL 

TCVHGV 

DGGDSS 

DGGDIS 

TGALSGAATA 



I 



2125 



I 



1 



I 



I . 



2135 



2145 



I 



I 



I 



2155 



LACESQ 

1 I 



2115 

KLRSRVK 

KYYSRVR 

NYIGKVV- — 
SVNVKVT— 

ESGAKE 

ESDAKE 

PGTAKEQKVC ASDSVVDQW SGFLSOLSGA TVDVKEVKLN GVKKPIKVED 

ENSKAPVY YPVLDAISLK 

QPTSEEVVEN PTIQKEVIE CDVKTTEWG 



-FVNGRVVIT NVGEPIISQP 
-SVRGRAIIV SVEQLEPCAQ 
-VVKGTTIW NVGKPVVAPS 
-QIKGTVAIT SLIGPIIG— 
TKEINIIKLS GVKKPFKVED 
PKEINIXKLS GVKKPFKVED 



2165 
SKLLNGIAYT 
SRLLSGVAYT 
HLFLKGVSYT 
-EVLEATGYI 
SVIVNDDTSE 
SVIVMDDTSE 
SWVNDPTSE 
AIWVEGNANF 
NVILKPSDEG 



1 I 

2175 
TFS — GSFDN 
AFS— GPVDK 
TFLDNGNGVV 
CYS— GSNRN 
TKYVKSLSIV 
IKYVKSLSIV 
TKVVKSLSIV 

VVG HP 

VKVTQELGHE 



I 



2185 
GHYVVYDAAN 
GHYTVYDTAK 
GHYTVFDHGT 
GHYTYYDNRN 
DVYDMWLTGC 
DVYDMHLTGC 
DVYDMFLTGC 
NYYSKSLHIP 
DLMAAYVENT 



2195 
NAVYDGARLF 
KSMYDGDRFV 
GMVHDGDAFV 
GLVVDAEKAY 
KYWRTANAL 
RCWRTANAL 
RYWWMANEL 
TFWENAENFV 
SITIKKPNEL 



I I 

2205 
ASDLSTLAVT 
KHDLSLLSVT 
PGDLNVSPVT 
HFNRDLLQVT 
SRAVNVPTIR 
SRAVNVPTIR 
SRLVNSPTVR 
KMGDKIGGVT 
SLALGLKTIA 



I I 

2215 
AIWVGGCVT 
SWMVGGYVA 
NWVSEQTAV 
TAIASNFWK 
KFIKFGMTLV 
KFIKFGMTLV 
EYVKWGMTKI 
MGLWRAEHLN 
THGIAAINSV 



I I I I I I I I I I 

2225 2235 2245 2255 2265 

EMCR S NVPP IVSEKISVMD KLDTG AQ KFFQFGDFVM 

229E PV NTVKPKPVIN QLDEK AQ KFFDFGDFLI 

PEDV V IKDP VKKAELDATK LLDTMNYASE RFFSFGDFMS 

TGEV KPQAEERPKN CAFNKVAASP KIVQEQKLLA lESGANYALT EFGRYADMFF 

OV43 SIP IDLL NLREIKPAVN WKAVRNKIS VCFNFIKWLF 

BoCoV SIP IDLL NLREIKPVFN WKAVRNKIS ACFNFIKWLF 

MHV VIP AKLV LLRDEKQEFV APKWKAKVI ACYSAVKWFF 

AIBV KPN LERI FNIAKKAIVG SSWTTQCGK LIGKAATFIA 

SARS CoV PWS KILA YVKPFLG — QAAITTSN CAKRLAQRVF 



I I 

2275 
NNIVLFLTWL 
HNFVIFFTWL 
RNLITVFLYI 
MAGDKILRLL 
VLLFGWIKIS 
VLLFGWIKIS 
LYCFSWIKFN 
DKVGGGWRN 
NNYMPYVFTL 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BOCOV 

MHV 

AIBV 

SARS COV 



2285 
LSMFSLLRTS 
LSMFTLCKTA 
LSILGLCFRA 
LEVFKYLLVL 
ADNKVIYTTE 
ADNKVIYTTE 
TDNKVIYTTE 
ITDSIKGLCG 
LFQLCTFTKS 

I I 



1 I 

2295 
IMKHDIKVIA 
VTTGDVKIMA 
FRKRDVKVLA 
FMCLRSTKMP 
lASKLTCKLV 
VASKLTCKLV 
VASKLTFNLC 
ITRGHFERKM 
TNSRIRASLP 



I I 

2305 
KAPKRTGVIL 
KAPQRTGVVL 
GVPQRTGIIL 
KVKVKP-PLA 
ALAFKNAFLT 
ALAFKNAFLT 
CLAFKNALQT 
SPQFLKTLMF 
TTIAKNSVKS 



I I 

2315 
TRSFKYNIRS 
KRSLKYNLKA 
RKSMRYNAKA 
FKDFGAKVRT 
FKWSMVARGA 
FKWSWARGA 
FNWNVVSRGF 
FLFYFLKASV 
VAKLCLDAGI 



\ I 

2325 
ALFVVKQKWC 
SAAVLKSKWW 
LGVFFKLKLY 
LNYMRQLNKP 
CIIATIFLLW 
CIIATIFLLW 
FLVATVFLLW 
KSVVASYKTV 
NYVKSPKFSK 



I I 

2335 
VIVTLFKFLL 
LLAKFTKLLL 
WFKVLGKFSL 
SVWRYAKLVL 
FNFIYANVIF 
FNFIYANVIF 
FNFLYANVIL 
LCKWLATLL 
LFTIAMWLLL 



2345 
LLYAIYALVF 
LIYTLYSVVL 
GIYALYALLF 
LLIAIYNFFY 
SDFYLPKIGF 
SDFYLPKIGF 
SDFYLPNIGF 
IVWFVYTSNP 
LSICLGSLIC 



I I 

2355 
MIVQFSPFNS 
LCVRFGPFN- 
MTIRFTPIGS 
LFVSIPWHK 
LPTFVGKIAQ 
LPTFVGKIVQ 
FPTFVGQIVA 
VMFTGIRVLD 
VTAAFGVLLS 



I I 

2365 
LLCGDIVSGY 
-FCSETVNGY 
PVCDDWAGY 
LTCNGAVQAY 
WIKNTFSLVT 
WIKNTFSLVT 
WVKTTFGIFT 
FLFEGSLCGP 
NFGAPSYCN6 



2375 

EKSTFN 

AKSNFV 

ANSSFD 

KNSSFI 

ICDLYSMQDV 
ICDLYSIQDV 
LCDLYQVSDV 
YKDYGK— DS 
VRELYLNSSN 



f I 

2385 
— KDIYCGNS 
— KDDYCDGS 
— KNEYCN-S 
— KSAVCGNS 
GFKNQYCNGS 
GFKNQYCNGS 
GYRSSFCNGS 
FDVLRYCADD 
VTTMDFCEGS 



2395 
MVCKMCLFSY 
LGCKMCLFGY 
VICKVCLYGY 
ILCKACLASY 
lACQFCLAGF 
lACQFCLAGF 
MVCELCFSGF 
FICRVCLHDK 
FPCSICLSGL 



I I I I I I I I 1 I I I 

2405 2415 2425 2435 2445 2455 

EMCR QEFNDLDHTS LVWKHIR D— P — -ILISLQPFV ILVILLIFGN MYLRFGLLYF 

229E QELSQFSHLD WWKHIT D— P LFSNMQPFI VMVLLLIFGD NYLRCFLLYF 

PEDV QELSDFSHTQ VVWQHLR D— P LIGNVMPFF YLAFLAIFGG VYVKAITLYF 

TGEV DELADFQHLQ VTWDFKS D— P LWNRLVQLS YFAFLAVFGN NYVRCFLMYF 

OV43 DMLDNYKAID VVQYEADRRA FVDYTGVLKI VIELIVSYAL YTAWFYPLFA LISIQILTTW 

BoCoV DMLDNYKAID VVQYEADRRA FVDYTGVLKI VIELIVSYAL YTAWFYPLFA LISIQILTTW 

MHV DMLDNYDAIN VVQHWDRRV SFDYISLFKL VVELVIGYSL YTVCFYPLFG LIGMQLLTTW 

AIBV DSLHLYKHAY SVEQVYKDAA SG — FIFNWNWL YLVFLILFVK PVAGFVIICY 

SARS CoV DSLDSYPALE TIQVTIS — S YKLDLTILGL AAEWVLAYML FTKFFYLLGL SAIMQVFFGY 
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I I I. ...I 1 I I I I I 

2465 2475 2485 2495 2505 2515 

EMCR VAQFISTFG SFLGFHQKQ WFLHFVPFDV LCNEFLATFI VCKIVLFVRH IIVGCNNADC 

229E VAQMISTVG VFLGYKETN WFLHFIPFDV ICDELLVTVI VIKVISFVRH VLFGCENPDC 

PEDV IFQYLNSLG VFLGLQQSI WFLQLVPFDV FGDEIWFFI VTRVLMFIKH VCLGCDKASC 

TGEV VSQYLNLWL SYFGYVEYS WFLHWNFES ISAEFVIWI WKAVLALKH IVFACSNPSC 

OV43 LPELFMLST- -LHWSFRLLV ALANMLPAHV FMRFYIIIAS FIKLFSLFRH VAYGCSKSGC 

BoCoV LPELLMLST- -LHWSVRLLV SLANMLPAHV FMRFYIIIAS FIKLFSLFRH VAYGCSKSGC 

MHV LPEFFMLET- -MHWSARFFV FVANMLPAFT LLRFYIVVTA MYKIFCLCRH VMYGCSRPGC 

AIBV CVKYLVLNST VLQTGVCFLD WFVQTVFSHF NFMGAGFYFW LFYKIYIQVH HILYCKDVTC 

SARS CoV FASHFISN — — SWLMWFII SIVQMAPVSA MVRMYIFFAS FYYIWKSYVH IMDGCTSSTC 

1 I I I I I I I I I I I 

2525 2535 2545 2555 2565 2575 

EMCR VACSKSARLK RVPLQTIING MHKSFYVNAN GGTCFCNKHN FFCVNCDSFG PGNTFINGDI 

22 9E lACSKSARLK RFPVNTIVNG VQRSFYVNAN GGSKFCKKHR FFCVDCDSYG YGSTFITPEV 

PEDV VACSKSARLK RVPVQTIFQG TSKSFYVHAN GGSKFCKKHN FFCLNCDSYG PGCTFINDVI 

TGEV KTCSRTARQT RIPIQVWNG SMKTVYVHAN GTGKFCKKHN FYCKNCDSYG FENTFICDEI 

OV4 3 LFCYKRNRSL RVKCSTIVGG MIRYYDVMAN GGTGFCSKHQ WNCIDCDSYK PGNTFXTVEA 

BoCoV LFCYKRNRSL RVKCSTIVGG MIRYYDVMT^ GGTGFCSKHQ WNCIDCDSYK PGNTFITVEA 

MHV LFCYKRNRSV RVKCSTWGG TLRYYDVMAN GGTGFCAKHQ WNCLNCSAFG PGNTFITHEA 

AIBV EVCKRVARSN RQEVSVVVGG RKQIVHVYTN SGYNFCKRHN WYCRNCDDYG HQNTFMSPBV 

SARS CoV MMCYKRNRAT RVECTTIVNG MKRSFYVYAN GGRGFCKTHN WNCLNCDTFC TGSTFISDEV 

I I I I 1 1 I ) 1 I I I 

2585 2595 2605 2615 2625 2635 

EMCR ARELGNVVKT AVQPTAPAYV IIDKVDFVNG FYRLYSGDTF WRYDFDITES KYSCKE 

22 9E SRELGNITKT NVQPTGPAYV MIDKVEFENG FYRLYSCETF WRYNFDITES KYSCKE 

PEDV ATEVGNWKL NVQPTGPATI LIDKVEFSNG FYYLYSGDTF WKYNFDITDS KYTCKE 

TGEV VRDLSNSVKQ TVYATDRSHQ EVTKVECSDG FYRFYVGDEF TSYDYDVKHK KYSSQE 

OV43 ALDLSKELKR PIQPTDVAYH TVTDVKQVGC SMRLFYDRDG QRTYDDVNAS LFVDYSNLLH 

BoCoV ALDLSKELKR PIQPTDVAYH TVTDVKQVGC YMRLFYDRDG QRTYDDVNAS LFVDYSNLLH 

MHV AADLSKELKR PVNPTDSAYY LVTEVKQVGC SMRLFYERDG QRVYDDVSAS LFVDMNGLLH 

AIBV AGELSERLKR HVKPTAYAYH WDEACLVDO FVNLKYKAAT PGKDSASSAV KCFSVTDFLK 

SARS CoV ARDLSLQFKR PINPTDQSSY IVDSVAVKNG ALHLYFDKAG QKTYERHPLS HFVNLDNLRA 

\ I I I I I ! I I I I I 

2645 2655 2665 2675 2685 2695 

EMCR -VLKNCNVLE NFIVYNN SGSNI TQIKNACVYF SQLLCEPIKL VNSELLSTLS 

229E -VFKNCNVLD DFIVFNN NGTNV TQVKNASVYF SQLLCRPIKL VDSELLSTLS 

PEDV -ALKNCSilT DFIVFNN NGSNV NQVKNACVYF SQMLCKPVKL VDSALLASLS 

TGEV -VLKSMLLLD DFIVYSP SGSAL ANVRNACVYF SQLIGKPIKI VNSDLLEDLS 

OV43 SKVKSVPNMH VWVEN DADK ANFLNAAVFY AQSLFRPILM VDKNLITTAN 

BoCoV SKVKSVPNMH VWVEN DADK ANFLNAAVFY AQSLFRPILM VDKILITTAN 

MHV SKVKGVPETH WVVEN EADK AGFLNAAVFY AQSLYRPMLL VEKKLITTAN 

AIBV KAVFLKEALK CEQISNDGFI VCNTQSAHAL EEAKNAAIYY AQYLCKPILI LDQALYEQLV 

SARS COV NNTKGSLPIN VIVFDGK SKCDE SASKSASVYY SQLMCQPILL LDQVLVSDVG 

I I I I I I I I I I 1 1 

2705 2715 2725 2735 2745 2755 

EMCR — VDFNGVLH KAYVDVLCNS FFKELTANMS MAECKATLGL T 

229E —VDFNGVLH KAYIDVLRNS FGKDLNANMS LAECKRALGL S 

PEDV — VDFGASLH SAFVSVLSNS FGKDLSSCND MQDCKSTLGF DD 

TGEV — VDFKGALF NAKKNVIKNS FNVDVSECKN LDECYRACNL N 

OV43 TGTSVTETMF DVYVDTFLSM FDVDKKSLNA LIATAHSSIK QGTQIYKVLD TFLSCARKSC 

BoCoV TGTSVTETMF DVYVDTFLSM FDVDKKSLNA LIATAHSSIK QGTQICKVLD TFLSCARKSC 

MHV TGLSVSQTMF DLYVDSLLGV LDVDRKSLTS FVNAAHNSLK EGVQLEQVMD TFIGCARRKC 

AIBV V-EPVSKSVI DKVCSILSSI ISVDTAALNY KAGTLRDALL S 

SARS CoV DSTEVSVKMF DAYVDTFSAT FSVPMEKLKA LVATAHSELA KGVALDGVLS TFVSAARQG- 

I I I I I I I I I I I. ...I 

2765 2775 2785 2795 2805 2815 

EMCR VSDDD FVSAVANAHR YDVLLSDLSF NNFFISYAKP EDK-LSVYDI ACCMRAGSKV 

229E ISDHE FTSAISNAHR CDVLLSDLSF NNFVSSYAKP EEK-LSAYDL ACCMRAGAKV 

PEDV VPLDT FNAAVAEAHR YDVLLTDMSF NNFTTSYAKP EEK-FPVHDI ATCMRVGAKI 

TGEV VSFST FEMAVNNAHR FGILITDRSF NNFWPSKVKP GSSGVSAMDI GKCMTSDAKI 

OV43 SIDSDVDTKC LADSVMSAVS AGLELTDESC NNLVPTYLKS DN — IVAADL GVLIQNSAKH 

BoCoV SIDSDVDTKC LADSVMSAVS AGLELTDESC NNLVPTYLKG DN — IVAADL GVLIQNSAKH 

MHV AIDSDVETKS ITKSIMSAVN AGVDFTDESC NNLVPTYVKS DT — IVAADL GVLIQNNAKH 

AIBV ITKDEE AVDMAIFCHN HDVDYTGDGF TNVIPSYGID TG-KLTPRDR GFLINADASI 

SARS CoV VVDTDVDTKD VIECLKLSHH SDLEVTGDSC NNFMLTYNKV EN — MTPRDL GACIDCNARH 

I I I I I I I I I I I I 

2825 2835 2845 2855 2865 2875 

EMCR VNHNVLIKES IPIVWGVKDF NTLSQEGKKY LVKTTKAKGL TFLLTFNDNQ AITQVP 

229E VNANVLTKDQ TPIVWHAKDF NSLSAEGRKY IVKTSKAKGL TFLLTINENQ AVTQIP 

PEDV VNHNVLVKDS IPVVWLVRDF lALSEETRKY IIRTTKVKGI TFMLTFNDCR MHTTIP 

TGEV VNAKVLTQRG KSVVWLSQDF AALSSTAQKV LVKTFVEEGV NFSLTFNAVG SDDDLPYERF 

OV43 VQGNVAKIAG VSCIWSVDAF NQFSSDFQHK LKKACCKTGL KLKLTYNKQM ANVSVLT 

BoCoV VQGNVAKIAG VSCIWSVDAF NQLSSDFQHK LKKACCKTGL KLELTYNKQM ANVSVLT 

MHV VQANVAKAAN VACIWSVDAF NQLSADLQHR LRKACSKTGL KIKLTYNKQE ANVPILT 

AIBV ANLRVKN--A PPVVWKFSEL IKLSDSCLKY LISATVKSGV RFFITKSGAK QVIACHT 

SARS CoV INAQVAKSHN VSLIWNVKDY MSLSEQLRKQ IRSAAKKNNI PFRLTCATTR QVVNVIT 
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2885 2895 2905 2915 2925 2935 

EMCR ATSIVAKQGA G FKRTYNFL WYVCLFVVAL FIGVSFID YTTTVTS 

229E ATSIVAKQGA GD AGHSLTWL WLLCGLVCLI QFYLCFFMPY — FMYDIVSS 

PEDV TVCIANKKGA GLPS FSKVKKFP WFLCLFIVAA FFALSFLD FSTQVSS 

TGEV TESVSPKSGS G FFDVITQL KQIVILVFVF IFICGLCSVY SVATQSYIES 

OV43 — TPFSLKGG A — V — FSYFVYVC FVLSLVCFIG LWCLMPT YTVHKSDFQL 

BoCoV — TPFSLKGG A — V — FSYFVYVC FVLSLVCFIG LWCLMPT YTVHKSDFQL 

MHV —TPFSLKGG A — V FSKVLQWL FWNLICFIV LWALMPT YAVHKSDMQL 

AIBV QKLLVEKKAG GIVSGTFKCF KSYFKWLLIF YILFTACCSG YYYMEVSKSF VHPMYDVNST 
SARS CoV — TKISLKGG K — I VSTCFKLM LKATLLCVLA ALVCYIVMPV HTLSIHDGYT 

....I. ...I ..I ....1 I ..I I. ...I 

2945 2955 2965 2975 2985 2995 

EMCR FHGYDFKYIE NGQLKVFEAP LHCVRNVFDN FNQWHEAKFG WTTNSDKCP IWG VS 

22 9E FEGYDFKYIE NGQLKNFEAP LKCVRNVFEN FEDWHYAKFG FTPLNKQSCP IVVG VS 

PEDV DSDYDFKYIE SGQLKTFDNP LSCVHNVFIN FDQWHDAKFG FTPVNNPSCP IVVG VS 

TGEV AEGYDYMVIK NGIVQPFDDT ISCVHNTYKG FGDWFKAKYG FIPTFGKSCP IVVGT-VFDL 

OV43 PVYASYKVLD NGVIRDVSVE DVCFANKFEQ FDQWYESTFG LSYYSNSMAC PIVVA-VIDQ 

BoCoV PVYASYKVLD NGVIRDVSVE DVCFANKFEQ FDQWYESTFG LSYYSNSMAC PIVVA-VVDQ 

MHV PLYASFKVID NGVLRDVTVT DACFANKFIQ FDQWYESTFG LVYYRNSRAC PVWA«VIDQ 

AIBV LHVEGFKVID KGVLREIVPE DTCFSNKFVN FDAFWGRPYD NSRNCPIVTA VIDGDGTVAT 

SARS CoV NEIIGYKAIQ DGVTRDIIST DDCFANKHAG FDAWFSQRGG S — YKNDKSC PVVAA-IITR 

I I I I I I ....I I I I 1 I 

3005 3015 3025 3035 3045 3055 

EMCR ERINWPGVP TNVYLVG -KTLVFTLQA AFGNTGVCYD FDGVTTS — DKCIFNSA 

229E EIVNTVAGIP SNVYLVG -KTLIFTLQA AFGNAGVCYD IFGVTTP — EKCIFTSA 

PEDV DEARTVPGIP AGVYLAG KTLVFAINT IFGTSGLCFD ASGVADK — GACIFNSA 

TGEV ENMRPIPDVP AYVSIVG RSLVFAINA AFGVTNMCYD HTGNAVSKDS YFDTCVFNTA 

OV43 DFGSTVFNVP TKVLRYG YHVLHFITHA LSADGVQCYT PHSQISYSNF YASGCVLSSA 

BoCoV DFGSTVFNVP TKVLRYG YHVLHFITHA LSADGVQCYT PHSQISYSNF YASGCVLSSA 

MHV DIGYTLFNVP TKVLRYG FHVLHFITHA FATDSVQCYT PHMQIPYDNF YASGCVLSSL 

AIBV GVPGFVSWVM DGVMFIHMTQ TERKPWYIPT WFNREIVGYT QDSIITEGSF YTSIALFSAR 

SARS CoV EIGFIVPGLP GTVLRAIN — GDFLHFLPRV FSAVGNICYT PSKLIEYSDF ATSACVLAAE 

I I I I I I I 1 I I 1 I 

3065 3075 3085 3095 3105 3115 

EMCR CTRLEGLGGD -NVYCYNTDL lEGSKPYSIL QPNAYYKYDV K-NYVRFPEI LARGFGLRTI 

229B CTRLEGLGGN -NVYCYNTAL MEGSLPYSSI QANAYYKYDN G-NFIKLPEV lAQGFGFRTV 

PEDV CTTLSGLGGT -AVYCYKNGL VEGAKLYSEL APHSYYKMVD G-NAVSLPEI ISRGFGIRTI 

TGEV CTTLTGLGGT -IVYCAKQGL VEGAKLYSDL MPDYYYEHAS G-NMVKLPAI IR-GLGLRFV 

OV43 CTMFTMADGS PQPYCYTEGL MQNASLYSSL VPHVRYNLAN AKGFIRFPEV LREGL-VRIV 

BoCoV CTMFAMADGS PQPYCYTDGL MQNASLYSSL VPHVRYNLAN AKGFIRLPEV LREGL-VRIV 

MHV CTMLAHADGT PHPYCYTEGI MHNASLYDSL APHVRYNLAN SNGYIRFPEV VSEGI-VRIV 

AIBV CLYLTASNTP QLYCFNGDND APGALPFGSI IPHRVYFQPN GVRLIVPQQI LHTPY W 

SARS COV CTIFKDAMGK PVPYCYDTNL LEGSISYSEL RPDTRYVLMD G-SIIQFPNT YLEGS-VRW 

I I I I I I I I I I I I 

3125 3135 3145 3155 3165 3175 

EMCR RTLATRYCRV GECRDSHKGV CFGFDKWYVN DGRVD DG YICGDGLIDL LVNVLSIFSS 

22 9E RTIATKYCRV GECVESNAGV CFGFDKWFVN DGRVA NG YVCGTGLWNL VFNILSMFSS 

PEDV RTKAMTYCRV GQCVQSAEGV CFGADRFFVY NAESG SD FVCGTGLFTL LMNVISVFSK 

TGEV KTQATTYCRV GECIDSKAGF CFGGDNWFVY DNEFG NG YICGNSVLGF FKNVFKLFNS 

OV43 RTRSMSYCRV GLCEEADEGI CFNFNGSWVL NNDYYRSLPG TFCGRDVFDL lYQLFKGLAQ 

BoCoV RTRSMSYCRV GLCEEADEGI CFNFNGSWVL NNDYYRSLPG TFCGRDVFDL lYQLFKGLAQ 

MHV RTRSMTYCRV GLCEDAEEGV CFNFNSSWVL NNPYYRAMPG TFCGRNAFDL IHQVLGGLVR 

AIBV KFVSDSYCRG SVCEYTRPGY CVSLNPQWVL FNDEYTSKPG VFCGSTVREL MFSMVSTFFT 

SARS CoV TTFDAEYCRH GTCERSEVGI CLSTSGRWVL NNEKYRALSG VFCGVDAMNL lANIFTPLVQ 

I I I I I I I I I I I I 

3185 3195 3205 3215 3225 3235 

EMCR SFSVVAMSGH MLFNFLFAAF ITFLCFLVTK FKRVFGDLSY GVFTVVCATL INNISYVVTQ 

229E SFSVAAMSGQ ILLNCALGAF AIFCCFLVTK FRRMFGDLSV GVCTVVVAVL LNNVSYIVTQ 

PEDV TVPVTVLSGQ ILFNCIIAFV AVAVCFLFTK FKRMFGDMSV GVFTVGACTL LNNVSYIVTQ 

TGEV NMSWATSGA MLVNIIIACL AIAMCYGVLK FKKIFGDCTF LIVMIIVTLV VNNVSYFVTQ 

OV43 PVDFLALTAS SIAGAILAVI WLVFYYLIK LKRAFGDYTS VVFVNVIVWC VNFMMLFVFQ 

BoCoV PVDFLALTAS SIAGAILAVI WLGFYYLIK LKRAFGDYTS IVFVNVIVWC VNFMMLFVFQ 

MHV PIDFFALTAS SVAGAILAII WLAFYYLIK LKRAFGDYTS VWINVIVWC INFLMLFVFQ 

AIBV GVN-PNIYMQ LATMFLILW WHFAMVIK FQGVFKAYAT TVFITMLVWV INAFILCVHS 

SARS COV PVGALDVSAS WAGGIIAIL VTCAAYYFMK FRRVFGEYNH VVAANALLFL MSFTILCLVP 

I I I I ...,|....| I I 1 I 1 I 

3245 3255 3265 3275 3285 3295 

EMCR N-LFFMLLYA ILYFVFTRTV R— YAWIWHI AYIVAYFLLI PWWLLTWFSF AAFLELLPNV 

229E N-LVTMIAYA ILYFFATRSL R— YAWIWCA AYLIAYISFA PWWLCAWYFL AMLTGLLPSL 

PEDV N-TLQJLGYA TLYFLCTKGV R— YMWIWHL GFLISYILIA PWWVLMVYAF SAIFEFMPNL 

TGEV N-TFFMIIYA IVYYFITRKL A— YPGILDA GFIIAYINMA PWYVITAYIL VFLYDSLPSL 

OV43 VYPILSCVYA ICYFYATLYF PSEISVIMHL QWLVMYGTIM PLWFCLLYIA VVVSNHAFWV 

BoCoV VYPTLSCVYA ICYFYATLYF PSEISVIMHL QWLVMYGTIM PLWFCLLYIS VVVSNHAFWV 

MHV VYPTLSCLYA CFYFYTTLYF PSEISWMHL QWLVMYGAIM PLWFCIIYVA VWSNHALWL 

AIBV YNSVLAVILL VLYCYASLVT SRNTVIIMHC WLVFTFGLIV PTWLACCYLG FIIYMYTPLF 

SARS COV AYSFLPGVYS VFYLYLTFYF TNDVSFLAHL QWFAMFSPIV PFWITAIYVF CISLKHCHWF 
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I 1 I I I I I I I I I I 

3305 3315 3325 3335 3345 3355 

EMCR FKLKISTQ LFEGDKFI GTFESAAAGT FVLDMRSYER LIKT— ISPE KLKNYAASYN 

229E LKLKVSTN LFEGDKFV GTFESAAAGT FVIDMRSYEK LANS — ISPE KLKSYAASYN 

PEDV FKLKVSTQ LFEGDKFV GSFENAAAGT FVLDMHAYER LANS— ISTE KLRQYASTYN 

TGBV FKLKVSTN LFEGDKFV GNFESAAMGT FVIDMRSYET IVNS — TSIA RIKSYANSFN 

OV43 FSYCRKLG — — TSVR — SD GTFEEMALTT FMITKDSYCK LKNS — LSDV AFNRYLSLYN 

BOCOV FSYCRQLG TSVR— SD GTFEEMALTT FMITKDSYCK LKNS— LSDV AFNRYLSLYN 

MHV FSYCRKLG — — TEVR— SD GTFEEMSLTT FMITKESYCK LKNS— VSDV AFNRYLSLYN 

AIBV LWCYGTTKNT RKLYDGNEFV GNYDLAAKST FVIRGSEFVK LTNE IGD KFEAYLSAYA 

SARS COV FNNYLRKR — — VMFNGVTF STFEEAALCT FLLNKEMYLK LRSETLLPLT QYNRYLALYN 



I I I I I I I I I I I I 

3365 3375 3385 3395 3405 3415 

EMCR KYKYYSGSAS EADYRCACYA HLAKAMLDYA -KDHNDMLYS PPTISYN-ST LQSGLKKMAQ 

229E RYKYYSGNAN EADYRCACYA YLAKAMLDFS -RDHNDILYT PPTVSYG-ST LQAGLRKMAQ 

PEDV KYKYYSGSAS EADYRLACFA HLAKAMMDYA -SNHNDTLYT PPTVSYN-ST LQAGLRKMAQ 

TGEV KYKYYTGSMG EADYRMACYA HLGKALMDYS -VNRTDMLYT PPTVSVN-ST LQSGLRKMAQ 

OV43 KYRYYSGKMD TAAYREAACS QLAKAMDTFT NNNGSDVLYQ PPTASVSTSF LQSGIVKMVN 

BoCoV KYRYYSGKMD TAAYREAACS QLAKAMDTFT NNNGSDVLYQ PPTASVSTSF LQSGIVKMVN 

MHV KYRYFSGKMD TAAYREAACS QLAKAMETFN HNNGNDVLYQ PPTASVTTSF LQSGIVKMVF 

AIBV RLKYYSGTGS EQDYLQACRA WLAYALDQYR -NSGVEIVYT PPRYSIGVSR LQSGFKKLVS 

SARS Gov KYKYFSGALD TTSYREAACC HLAKALNDFS -NSGADVLYQ PPQTSITSAV LQSGFRKMAF 



I I I I I I I I I I I I 

3425 3435 3445 3455 3465 3475 

EMCR PSGCVERCVV RVCYGSTVLN GVWLGDTVTC PRHVIAPSTT VL-IDYDHAY STMRLHNFSV 

22 9E PSGFVEKCVV RVCYGNTVLN GLWLGDIVYC PRHVIASNTT SA-IDYDHEY SIMRLHNFSI 

PEDV PSGWEKCIV RVCYGNMALN GLWLGDIVMC PRHVIASSTT ST-IDYDYAL SVLRLHNFSI 

TGEV PSGLVEPCIV RVSYGNNVLN GLWLGDEVIC PRHVIASDTT RV-INYENEM SSVRLHNFSV 

OV43 PTSKVEPCW SVTYGNMTLN GLWLDDKVYC PRHVICSASD MTNPDYTNLL CRVTSSDFTV 

BoCoV PTSKVEPCIV SVTYGNMTLN GLWLDDKVYC PRHVICSASD MTNPDYTNLL CRVTSSDFTV 

MHV PTSKVEPCW SVTYGNMTLN GLWLDDKVYC PRHVICSSAD MTDPDYSNLL CRVISSDFCV 

AIBV PSSAVEKCIV SVSYRGNNLN GLWLGDTIYC PRHVLGKFSG DQ WNDVL NIANNHEFEV 

SARS CoV PSGKVEGCMV QVTCGTTTLN GLWLDDTVYC PRHVICTAED MLNPNYEDLL IRKSNHSFLV 



I I I I I I I I I 1 I I 

3485 3495 3505 3515 3525 3535 

EMCR SHNG-VFLGV VGVTMHGSVL RIKVSQSNVH TPKHVFKTLK PGASFNILAC YEGIASGVFG 

229E ISGT-AFLGV VGATMHGVTL KIKVSQTNMH TPRHSFRTLK SGEGFNILAC YDGCAQGVFG 

PEDV SSGN'VFLGV VSATMRGALL QIKVNQNNVH TPKYTYRTVR PGESFNILAC YDGAAAGVY6 

TGEV SKNN-VFLGV VSARYKGVNL VLKVNQVNPN TPEHKFKSIK AGESFNILAC YEGCPGSVYG 

OV43 LFDR-LSLTV MSYQMRGCML VLTVTLQNSR TPKYTFGVVK PGETFTVLAA YNGKPQGAFH 

BoCoV LFDR-LSLTV MSYQMQGCML VLTVTLQNSR TPKYTFGVVK PGETFTVLAA YNGKPQGAFH 

MHV MSGR-MSLTV MSYQMQGSLL VLTVTLQNPN TPKYSFGWK PGETFTVLAA YNGKSQGAFH 

AIBV TTQHGVTLNV VSRRLKGAVL ILQTAVANAE TPKYKFIKAN CGDSFTIACA YGGTVVGLYP 

SARS CoV QAGN-VQLRV IGHSMQNCLL RLKVDTSNPK TPKYKFVRIQ PGQTFSVLAC YNGSPSGVYQ 

I 1 I I I I I I I I 

3545 3555 3565 3575 3585 3595 

EMCR VNLRTNFTIK GSFINGACGS PGYNVRNDGT VEFCYLHQIE LGSGAHVGSD FTGSVYGNFD 

229E VNMRTNWTIR GSFINGACGS PGYNLKN-GE VEFVYMHQIE LGSGSHVGSS FDGVMYGGFE 

PEDV VNMRSNYTIR GSFINGACGS PGYNINN-GT VEFCYLHQLE LGSGCHVGSD LDGVMYGGYE 

TGEV VNMRSQGTIK GSFIAGTCGS VGYVLEN-GI LYFVYMHHLE LGNGSHVGSN FEGEMYGGYE 

OV43 VTMRSSYTIK GSFLCGSCGS VGYVIMG-DC VKFVYMHQLE LSTGCHTGTD FNGDFYGPYK 

BoCoV VTMRSSYTIK GSFLCGSCGS VGYVIMG-DC VKFVYMHQLE LSTGCHTGTD FNGDFYGPYK 

MHV VTMRSSYTIK GSFLCGSCGS VGYVLTG-DS VRFVYMHQLE LSTGCHTGTD FSGNFYGPYR 

AIBV VTMRSNGTIR ASFLAGACGS VGFNIEK-GV VNFFYHHHLE LPNALHTGTD LMGEFYGGYV 

SARS CoV CAMRPNHTIK GSFLNGSCGS VGFNIDY-DC VSFCYMHHME LPTGVHAGTD LEGKFYGPFV 



1 I 1 I I 1 I I I I I I 

3605 3615 3625 3635 3645 3655 

EMCR DQPSLQVESA NLMLSDNWA FLYAALLNGC R WWL RSTRVNVDGF NEWAMANGYT 

229E DQPNLQVESA NQMLTVNWA FLYAAILNGC T WWL KGEKLFVEHY NEWAQANGFT 

PEDV DQPTLQVEGA SSLFTENVLA FLYAALINGS T WWL SSSRIAVDRF NEWAVHNGMT 

TGEV DQPSMQLEGT NVMSSDNVVA FLYAALINGE R WFV TNTSMSLESY NTWAKTNSFT 

OV43 DAQVVQLLIQ DYIQSVNFVA WLYAAILNNC N WFV QSDKCSVEDF NVWALSNGFS 

BoCoV DAQWQLPVQ DYIQSVNFVA WLYAAILNNC N WFV QSDKCSVEDF NVWALSNGFS 

MHV DAQWQLPVQ DYTQTVNWA WLYAAILNRC N WFV QSDSCSLEEF NVWAMTNGFS 

AIBV DEEVAQRVPP DNLVTNNIVA WLYAAIISVK ESSFSLPKWL ESTTVSVDDY NKWAGDNGFT 
SARS COV DRQTAQAAGT DTTITLNVLA WLYAAVINGD R WFL NRFTTTLNDF NLVAMKYNYE 



I I I I I I I I I I 1 I 

3665 3675 3685 3695 3705 3715 

EMCR IVSSVEC— Y SILAAKTGVS VEQLLASIQH LHE-GFGGKN ILGYSSLCDE FTLAEWKQM 

229E AMNGEDA — F SILAAKTGVC VERLLHAIQV LNN-GFGGKQ ILGYSSLNDE FSINEWKQM 

PEDV TVGNTDC — F SILAAKTGVD VQRLLASIQS LHK-NFGGKQ ILGHTSLTDE FTTGEWRQM 

TGEV ELSSTDA — F SMLAAKTGQS VEKLLDSIVR LNK-GFGGRT ILSYGSLCDE FTPTEVIRQM 

OV43 QVKSDLV — I DALASMTGVS LETLLAAIKR LKN-GFQGRQ IMGSCSFEDE LTPSDVYQQL 

BoCoV QVKSDLV--I DALASMTGVS LETLLAAIKR LKN-GFQGRQ IMGSCSFEDE LTPSDVYQQL 

MHV SIKADLV — L DALASMTGVT VEQILAAIKR LYS-GFQGKQ ILGSCVLEDE LTPSDVYQQL 

AIBV PFSTSTA — I TKLSAITGVD VCKLLRTIMV KNS-QWGGDP ILGQYNFEDE LTPESVFNQI 

SARS COV PLTQDHVDIL GPLSAQTGIA VLDMCAALKE LLQNGMNGRT ILGSTILEDE FTPFDWRQC 



SUBSTITUTE SHEET (RULE 26) 
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I I I I I 1 I I I 1 I I 

3725 3735 3745 3755 3765 3775 

EMCR YGVNLQS GKVIFGLKTM FLFSVFFTMF WAELFIYTNT IWINPVILTP IFCLLLFLSL 

229E FGVNLQS GKTTSMFKSI SLFAGFFVMF WAELFVYTTT IWVNPGFLTP FMILLVALSL 

PEDV YGVNLQG GYVSRACRNV LLVGSFLTFF WSELVSYTKF FWVNPGYVTP MFACLSLLSS 

TGEV YGVNLQA GKVKSFFYPI MTAMTILFAF WLBFFMYTPF TWINPTFVSI VLAVTTLIST 

OV4 3 AGIKLQSKRT RLFKGTVCWI MASTFLFSCI ITAFVKWTMF MYVTTNMFS- ITFCALCVIS 

BoCoV AGIKLQSKRT RLVKGIVCWI MASTFLFSCI ITAFVKWTMF MYVTTNMLS- ITFCALCVIS 

MHV AGVKLQSKRT RWKGTCCWI LASTLLFCSI ISAFVKWTMF MYVTTHMLG- VTLCALCFVS 

AIBV GGVRLQS -SFVRKATSW FWSRCVLACF LFVLCAIVLF TAVPLKFYVY AAVILLMAVL 

SARS COV SGVTFQGKFK KIVKGTHHWM LLTFLTSLLI LVQSTQWSLF FFVYENAFLP FTLGIMAIAA 



I I I I |....( I I I I I I 

3785 3795 3805 3815 3825 3835 

EMCR VLTMFLKHKF LFLQVFLLPT VIATALYN CVLDYYIV KFLADHFN-Y NVSVLQMDVQ 

229E CLTFVVKHKV LFLQVFLLPS IIVAAIQN — — CAWDYHVT KVLAEKFD-Y NVSVMQMDIQ 

PEDV LLMFTLKHKT LFFQVFLIPA LIVTSCIN — — LAFDVEVY NYLAEHFD-Y HVSLMGFNAQ 

TGEV VFVSGIKHKM LFFMSFVLPS VILVTAHN — — LFWDFSYY ESLQSIVENT NTMFLPVDMQ 

OV43 LAMLLVKHKH LYLTMYITPV LFTLLYNNY- -LWYKHTFR GYVYAWLSYY VPSVEYTYTD 

BoCoV LAMLLVKHKH LYLTMYIIPV LFTLLYNNY- -LWYKQTFR GYVYAWLSYY VPSVEYTYTD 

MHV FAMLLVKHKH LYLTMFIMPV LCTLFYTNY LWYKQSFR GLAYAWLSHF VPAVDYTYMD 

AIBV FISFTVKHVM AYMDTFLLPT LITVIIGVCA EVPFIYNTLI SQWIFLSQW YDPWFDTMV 

SARS CoV CAMLLVKHKH AFLCLFLLPS LATVAYFN MVYMPASWV MRIMTWLELA DTSLSGYRLK 



1 I I I I I I I I I I I 

3845 3855 3865 3875 3885 3895 

EMCR GLVNVLVCLF WFLH TW RFSKERFTHW FTYVCSLIAV AYTYFYSGD- F 

22 9E GFVNIFICLF VALLH TW RFAKERCTHW CTYLFSLIAV LYTALYSYD- Y 

PEDV GLVNIFVCFV VTILHGTYTW RFFN-TPASS VTYWALLTA AYNYFYASD- 1 

TGEV GVMLTVFCFI VFVTYSVRFF TCKQSWFSLA VTTILVIFNM VKIFGTSDEP WTENQIAFCF 

OV43 EVIYGMLLLV GMVFVTLRSI NHDLFSFIMF VGRLISVFSL WYKGSNLEEE I 

BoCoV EVIYGMLLLI GMVFVTLRSI NHDLFSFIMF VGRVISWSL WYMGSNLEEE I 

MHV EVLYGWLLV AMVFVTMRSI NHDVFSVMFL VGRLVSLVSM WYFGANLEEE V 

AIBV PWMFLPLVLY TAFKCVQGCY MNSFNTSLLM LYQFVKLGFV lYTSSNTLTA YTEGNWELFF 
SARS CoV DCVMYASALV LLILMTARTV YDDAARRVWT LMNVITLVYK VYYGNALDQA 1 

I I I I I I I I I I I I 

3905 3915 3925 3935 3945 3955 



EMCR LSLLVMFLCA ISSDWYIGAI VFRLSRLIIF FSPE— SVFS VFGDVKLTLV VYLICGYLVC 

229E VSLLVMLLCA ISNEWYIGAI IFRICRFGVA FLPV — EYVS YFDGVKTVLL FYMLLGFVSC 

PEDV LSCAMTLFAS VTGNWFVGAV CYKVAVYMAL RFP TFVA IFGDIKSVMF CYLVLGYFTC 

TGEV VNMLTMIVSL TTKDWMWIA SYRIAYYIW CVMP-SAFVS DFGFMKCISI VYMACGYLFC 

OV43 LLMLASLFGT YTWT TVL SMAVAKVIAK WVAVNVLYFT DIPQIKIVLL CYLFIGYIIS 

BoCoV LLMLASLFGT YTWT TAL SMAAAKVIAK WVAVNVLYFT DIPQIKIVLV CYLFIGYIIS 

MHV LLFLTSLFGT YTWT TML SLATAKVIAK WLAVNVLYFT DVPQVKLVLL SYLCIGYVCC 

AIBV ELVHTTVLAN VSSNSLIGLF VFKCAKWMLY YCN AT YLNNYVLMAV MVNCIGWLCT 

SARS COV SMWALVISVT SNYSGVVTTI MFLARAIVFV CVEYYPLLFI TGNTLQCIML VYCFLGYCCC 



I 1 I I I I I I I I 1 I 

3965 3975 3985 3995 4005 4015 

EMCR TYWGILYWFN RFFKCTMGVY DFKVSAAEFK YMVANGLHAP YGPFDALWLS FKLLGIGGDR 

22 9E MYYGLLYWIN RFCKCTLGVY DFCVSPAEFK YMVANGLNAP NGPFDALFLS FKLMGIGGPR 

PEDV CFYGILYWFN RFFKVSVGVY DYTVSAAEFK YMVANGLRAP TGTLDSLLLS AKLIGIGGER 

TGEV CYYGILYWVN RFTCMTCGVY QFTVSAAELK YMTANNLSAP KNAYDAMILS AKLIGVGGKR 

OV43 CYWGLFSLMN SLFRMPLGVY NYKISVQELR YMNANGLRPP KNSFEALMLN FKLLGIGGVP 

BoCoV CYWGLFSLMN SLFRMPLGVY NYKISVQELR YMNANGLRPP KNSFEALMLN FKLLGIGGVP 

MHV CYWGVLSLLN SIFRMPLGVY NYKISVQELR YMNANGLRPP RNSFEALVLN FKLLGIGGVP 

AIBV CYFGLYWWVN KVFGLTLGKY NFKVSVDQYR YMCLHKINPP KTVWEVFSTN ILIQGIGGDR 

SARS CoV CYFGLFCLLN RYFRLTLGVY DYLVSTQEFR YMNSQGLLPP KSSIDAFKLN IKLLGIGGKP 



I I I 1 1 I I I I I f I 

4025 4035 4045 4055 4065 4075 

EMCR CIKISTVQSK LTDLKCTNW LLGCLSSMNI AANSSEWAYC VDLHNKINLC DDPEKAQGML 

229E TIKVSTVQSK LTDLKCTNW LMGILSNMNI ASNSKEWAYC VEMHNKINLC DDPETAQELL 

PEDV NIKISSVQSK LTDIKCSNVV LLGCLSSMNV SANSTEWAYC VDLHNKINLC NDPEKAQEML 

TGEV NIKISTVQSK LTEMKCTNW LLGLLSKMHV ESNSKEWNYC VGLHNEINLC DDPEIVLEKL 

OV43 IIEVSQFQSK LTDVKCANW LLNCLQHLHV ASNSKLHHYC STLHNEILAT SDLSVAFEKL 

BoCoV IIEVSQFQSK LTDVKCANGG LLNCLQHLHV ASNSKLWQYC STLHNEILAT SDLGVAFEKL 

MHV VIEVSQIQSR LTDVKCVNW LLNCLQHLHI ASSSKLWQYC STLHNEILAT SDLSVAFDKL 

AIBV VLPIATVQAK LSDVKCTTW LMQLLTKLNV EANSKMHVYL VELHNKILAS DDVGECMDNL 

SARS COV CIKVATVQSK MSDVKCTSW LLSVLQQLRV ESSSKLWAQC VQLHNDILLA KDTTEAFEKM 

I I I I I I I I I I I I 

4085 4095 4105 4115 4125 4135 

EMCR LALLAFFLSK HSDFG LDGLIDSYF DNSSTLQSVA SSFVSMPSYI AYENARQAYE 

229E LALLAFFLSK HSDFG LGDLVDSYF ENDSILQSVA SSFVGMPSFV AYETARQEYE 

PEDV LALLAFFLSK NSAFG LDDLLESYF NDNSMLQSVA STYVGLPSYV lYENARQQYE 

TGEV LALIAFFLSK HNTCD LSELIESYF ENTTILQSVA SAYAALPSWI ALEKARADLE 

OV43 AQLLIVLFAN PAAVDSKCLT SIEEVCDDYA KDNTVLQALQ SEFVNMASFV EYEVAKKNLD 

BoCoV AQLLIVLFAN PAAVDSKCLT SIEEVCDDYA KDNTVLQALQ SEFVNMASFV EYEVAKKNLD 

MHV AQLLVVLFAN PAAVDSKCLA SIEEVSDDYV RDSTVLQALQ SEFVNMASFV EYELAKKNLD 

AIBV LGMLITLFCI DSTID -LSEYCDDIL KRSTVLQSVT QEFSHIPSYA EYERAKNLYE 

SARS CoV VSLLSVLLSM QGAVD -INRLCEEML DNRATLQAIA SEFSSLPSYA AYATAQEAYE 
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I I I I I I I I I I I I 

4145 4155 4165 4175 4185 4195 

EMCR DAIANGSS SQLIKQLK RAMNIAKSEF DHEISVQKKI NRMAEQAATQ MYKEARSVNR 

229E NAVANGSS PQIIKQLK KAMNVAKAEF DRESSVQKKI NRMAEQAAAA MYKEARAVNR 

PEDV DAVMNGSP — — PQLVKQLR HAMMVAKSEF OREASTQRKL DRMAEQAAAQ MYKEARAVNR 

TGEV EAKKNDVS PQILKQLT KAFNIAKSDF EREASVQKKL DKMAEQAAAS MYKEARAVDR 

OV43 EARFSGSAN QQQLKQLE KACNIAKSAY ERDRAVAKKL ERMADLALTN MYKEARINDK 

BOCOV EACSSGSAN QQQLKQLE KACNIAKSAY ERDRAVARKL ERMADLALTN MYKEARINDK 

MHV BAKASGSAN- — QQQIKQLE KACNIAKSAY ERDRAVARKL ERMADLALTN MYKEARINDK 

AIBV KVLVDSKNGG VTQQELAAYR KAANIAKSVF DRDLAVQKKL DSMAERAMTT MYKEARVTDR 

SARS CoV QAVANGDS — — EVVLKKLK KSLNVAKSEF DRDAAMQRKL EKMADQAMTQ MYKQARSEDK 

I I I I \ I 1 I I I I I 

4205 4215 4225 4235 4245 4255 

EMCR KSKVISAMHS LLFGMLRRLD MSSVETVLNL ARDGWPLSV IPATSASKLT IVSPDLESYS 

229E KSKVVSAMHS LLFGMLRRLD MSSVDTILNM ARNGVVPLSV IPATSAARLV VWPDHDSFV 

PEDV KSKVVSAMHS LLFGMLRRLD MSSVDTILNL AKDGVVPLSV IPAVSATKLN IVTSDIDSYN 

TGEV KSKIVSAMHS LLFGMLKKLD MSSVNTIIDQ ARNGVLPLSI IPAASATRLV VITPSLEVFS 

OV43 KSKVVSALQT MLFSMVRKLD NQALNSILDN AVKGCVPLNA IPSLAANTLN IIVPDKSVYD 

BoCoV KSKVVSALQT MLFSMVRKLD NQALNSILDN AVKGCVPLNA IPSLAANTLT IIVPDKSVYD 

MHV KSKVVSALQT MLFSMIRKLD NQALNSILDN AVKGCVPLNA IPSLTSNTLT IIVPDKQVFD 

AIBV RAKLVSSLHA LLFSMLKKID SEKLNVLFDQ ASSGVVPLAT VPIVCSNKLT LVIPDPETWV 

SARS CoV RAKVTSAMQT MLFTMLRKLD NDALNNIINN ARDGCVPLNI IPLTTAAKLM WVPDYGTYK 

I I I I I I I. ...I I I I I 

4265 4275 4285 4295 4305 4315 

EMCR KIVCDGSVHY AGWWTLNDV KDNDGRPVHV KEITRENVET LT WPL ILNCERWK- 

229E KMMVDGFVHY AGVVWTLQEV KDNDGKNVHL KDVTKENQEI LV WPL ILTCERVVK- 

PEDV RIQREGCVHY AGTIWNIIDI KDNDGKWHV KEVTAQNAES LS WPL VLGCERIVK- 

TGEV KIRQENNVHY AGAIWTIVEV KDANGSHVHL KEVTAANELN LT WPL SITCERTTK- 

OV43 QVVDNVYVTY AGNVWQIQTI QDSDGTNKQL NEISDDCN — WPL VIIANRYNE- 

BoCoV QVVDNVYVTY AGNVWQIQTI QDSDGTNKQL HEISDDCN WPL VIIANRHNE- 

MHV QVVDNVYVTY AGNVWHIQSI QDADGAVKQL NEIDVNIT WPL VIAANRHNE- 

AIBV KCVEGVHVTY STWWNIDTV IDADGTELHP TSTGSGLTYC ISGANIAWPL KVNLTRNGHN 
SARS CoV NTCDGNTFTY ASALWEIQQV VDADSKIVQL SEINMDNSPN LA WPL IVTALRAN— 

I I 1 1 I I I I I I I I 

4325 4335 4345 4355 4365 4375 

EMCR LQ-NNE IMPGKLKQKP MKAEG — DGG VLGDGNALYN TEGGKTFMYA YISNKADLKF 

229E LQ-NNE IMPGKMKVKA TKGEG — DGG ITSEGNALYN NEGGRAFMYA YVTTKPGMKY 

PEDV LQ-NNE IIPGKLKQRS IKAEG — DG- IVGEGKALYN NEGGRTFMYA FISDKPDLRV 

TGEV LQ-NNE IMPGKLKERA VRASATLDGE AFGSGKALMA SESGKSFHYA FIASDNNLKY 

OV43 VSATVLQNNE LMPAKLKIQV VNSGPDQTCN TPT— QCYYN NSNNGKIVYA ILSDVDGLKY 

BoCoV VSATVLQNNE LMPAKLKTQV VNSGPDQTCN TPT — QCYYN NSYNGKIVYA ILSDVDGLKY 

MHV VSSVVLQNNE LMPQKLRTQV VNSGSDMNCN TPT — QCYYN TTGMGKIVYA ILSDCDGLKY 

AIBV KVDVVLQNNE LMPHGVKTKA CVAGVDQAHC SVES-KCYYT NISGNSWAA ITSSNPNLKV 

SARS CoV -SAVKLQNNE LSPVALRQMS CAAGTTQTAC TDDNALAYYN NSKGGRFVLA LLSDHQDLKW 

....I I ....I I I I I I I I I I 

4385 4395 4405 4415 4425 4435 

EMCR VKWEYEGG — CNTIELDSPC RFMVETPNGP QVKYLYFVKN LNTLRRGAVL GFIGATIRLQ 

229E VKWEHDSG — WTVELEPPC RFVIDTPTGP QIKYLYFVKN LNNLRRGAVL GYIGATVRLQ 

PEDV VKWEFDGG — CNTIELEPPR KFLVDSPNGA QIKYLYFVRN LNTLRRGAVL GYIGATVRLQ 

TGEV VKWESNND — IIPIELEAPL RFYVDGANGP EVKYLYFVKN LNTLRRGAVL GYIGATVRLQ 

OV43 TKILKDDGN- FVVLELDPPC KFTVQDAKGL KIKYLYFVKG CNTLARGWW GTISSTVRLQ 

BoCoV TKILKDDGN- FVVLELDPPC KFTVQDVKGL KIKYLYFVKG CNTLARGWW GTISSTVRLQ 

MHV TKIVKEDGN- CVVLELDPPC KFSVQDVKGL KIKYLYFVKG CNTLARGWW GTLSSTVRLQ 

AIBV ASFLNEAGN- QIYVDLDPPC KFGMKVGVKV EWYLYFIKN TRSIVRGMVL GAISNVVVLQ 

SARS CoV ARFPKSDGTG TIYTELEPPC RFVTDTPKGP KVKYLYFIKG LNNLNRGMVL GSLAATVRLQ 

I I I ! ....I I I.. ..I I I I I 

4445 4455 4465 4475 4485 4495 

EMCR AG-KQTELAV NSGLLTACAF SVDPATTYLE AVKHGAKPVS NCIKMLSNGA GNGQAITTSV 

229E AG-KQTEFVS NSHLLTHCSF AVDPAAAYLD AVKQGAKPVG NCVKMLTNGS GSGQAITCTI 

PEDV AG-KQTEQAI NSSLLTLCAF AVDPAKTYID AVKSGHKPVG NCVKMLANGS GNGQAVTNGV 

TGEV AG-KPTEHPS NSSLLTLCAF SPDPAKAYVD AVKRGMQPVN NCVKMLSNGA GNGMAVTNGV 

OV43 AG-TATEYAS NSSILSLCAF SVDPKKTYLD FIQQGGTPIA NCVKMLCDHA GTGMAITVKP 

BoCoV AG-TATEYAS NSSILSLCAF SVDPKKTYLD FIQQGGTPIA NCVKMLCDHA GTGMAITVKP 

MHV AG-TATEYAS NSAIRSLCAF SVDPKKTYLD YIQQGGAPVT NCVKMLCDHA GTGMAITIKP 

AIBV SKGHETEEVD AVGILSLCSF AVDPADTYCK YVAAGNQPLG NCVKMLTVHN GSGFAITSKP 

SARS CoV AG-NATEVPA NSTVLSFCAF AVDPAKAYKD YLASGGQPIT NCVKMLCTHT GTGQAITVTP 

1 I 1 I I I I I I I 1 I 

4505 4515 4525 4535 4545 4555 

EMCR DANTNQDSYG GASICLYCRA HVPHP S MDGYCKFKGK CVQVP-IGCL DPIRFCLENN 

229E DSNTTQDTYG GASVCIYCRA HVAHP T MDGFCQYKGK WVQVP-IGTN DPIRFCLENT 

PEDV EASTNQDSYG GASVCLYCRA HVEHP S MDGFCRLKGK YVQVP-LGTV DPIRFVLEND 

TGEV EANTQQDSYG GASVCIYCRC HVEHP A IDGLCRYKGK FVQIP-TGTQ DPIRFCIENE 

OV43 DATTSQDSYG GASVCIYCRA RVEHP D VDGLCKLRGK FVQVP-VGIK DPVSYVLTHD 

BoCoV DATTSQDSYG GASVCIYCRA RVEHP D VDGLCKLRGK FVQVP-VGIK DPVSYVLTHD 

MHV EATTNQDSYG GASVCIYCRS RVEHP D VDGLCKLRGK FVQVP-LGIK DPVSYVLTHD 

AIBV SPTPDQDSYG GASVCLYCRA HIAHPGSVGN LDGRCQFKGS FVQIP-TTEK DPVGFCLRNK 
SARS CoV EANMDQESFG GASCCLYCRC HIDHP N PKGFCDLKGK YVQIPTTCAN DPVGFTLRNT 
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( I ....I I I I I I I I I I 

4565 4575 4585 4595 4605 4615 

EMCR VCNVCGCWLG HGCACDRTTI QSVDIS YLNRARGSSA -ARLEPCN-G TDIDKCVRAF 

229E VCKVCGCWLN HGCTCDRTAI QSFDNS YLNRVRGSSA -ARLEPCN-G TDIDYCVRAF 

PEDV VCKVCGCWLS NGCTCDRSIM QSTDYG LFKRVRGSSA -ARLEPCN-G TDTQHVYRAF 

TGEV VCWCGCWLN NGCMCDRTSM QSFTVDQSY- LFKRVRGSSA -ARLEPCN-G TDPDHVSRAF 

OV43 VCRVCGFWRD GSCSCVSTDT TVQSKDTN — FFKRVRGTSV DARLVPCASG LSTDVQLRAF 

BoCoV VCQVCGFWRD GSCSCVSTDT TVQSKDT FFKRVRGTSV DARLVPCASG LSTDVQLRAF 

MHV VCQVCGFWRD GMFLCR-HRL PVSVKRHE — LFKRVRGTSV NARLVPCASG LDTDVQLRAF 

AIBV VCTVCQCWIG YGCQCDSLRQ PKSSVQS VAGASD FDKNYLNG — YGVAVRLGMF 

SARS CoV VCTVCGMWKG YGCSCDQLRE PLMQSADAST FLNRVCGVSA -ARLTPCGTG TSTDVVYRAF 

I I ....I I I I I I I I I I 

4625 4635 4645 4655 4665 4675 

EMCR DIYNKNVSFL GKCLKMNCVR FKNADLK DGYFVIKRC TKSVMEHEQS MYNLLNFSGA 

229E DVYNKDASFI GKNLKSNCVR FKNVDKD -DAFYIVKRC IKSVMDHEQS MYNLLKGCNA 

PEDV DIYNKDVACL GKFLKVNCVR LKNLDKH -DAFYWKRC TKSAMEHEQS lYSRLEKCGA 

TGEV DIYNKDVACI GKFLKTNCSR FRNLDKH -DAYYIVKRC TKTVMDHEQV CYNDLKDSGA 

OV4 3 DIYNASVAGI GLHLKVNCCR FQRVDENGDK LDQFFWKRT DLTIYNREMK CYERVKDCKF 

BoCoV DICNASVAGI GLHLKVNCCR FQRVDENGDK LDQFFVVKRT DLTIYNREME CYERVKDCKF 

MHV DICNANRAGI GLYYKVNCCR FQRADEDGNT LDKFFVIKRT NLEVYNKEKE CYELTKECGV 

AIBV QNLKRNCARF QELRDTEDGN LEYLDS YFVVKQT TPSNYEHEKS CYEDLKS-EV 

SARS CoV DIYNEKVAGF AKFLKTNCCR FQEKDEEGNL LDSYFWKRH TMSNYQHEET lYNLVKDCPA 

I I I I I 1 I I I 1 I I 

4685 4695 4705 4715 4725 4735 

EMCR LAEHDFFTWK DGRVIYGNVS RHNLTKYTMM DLVYAMRNFD EQNCDVLKEV LVLTGCCDNS 

229E VAKHDFFTWH EGRTIYGNVS RQDLTKYTMM DLCFALRNFD EKDCEVFKEI LVLTGCCSTD 

PEDV lAEHDFFTWK DGRAIYGNVC RKDLTEYTMM DLCYALRNFD ENNCDVLKSI LIKVGACEES 

TGEV VAEHDFFTYK EGRCEFGNVA RRNLTKYTMM DLCYAIRNFD EKNCEVLKEI LVTVGACTEE 

OV4 3 VAEHDFFTFD VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDZ LSIYAGCEQS 

BoCoV VAEHDFFTFD VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDI LSIYAGCEQS 

MHV VAEHEFFTFD VEGSRVPHIV RKDLSKYTML DLCYALRHFD RNDCSTLKEI LLTYAECDES 

AIBV TADHDFFVFN KN lYNIS RQRLTKYTMM DFCYALRHFD PKDCBVLKEI LVTYGCIEDY 

SARS CoV VAVHDFFKFR VDGDMVPHIS RQRLTKYTMA DLVYALRHFD EGNCDTLKEI LVTYNCCDDD 

I I I I I I I I I I I I 

4745 4755 4765 4775 4785 4795 

EMCR YFDSKG WYDPVENEDI HRVYASLGKI VARAMLKCVA LCDAMVAKGV VGVLTLDNQD 

229E YFEMKN WFDPIENEDI HRVYAALGKV VANAMLKCVA FCDEMVLKGV VGVLTLDNQD 

PEDV YFNNKV WFDPVENEDI HRVYALLGTI VARAMLKCVK FCDAMVEQGI VGWTLDNQD 

TGEV FFENKD WFDPVENEAI HEVYAKLGPI VANAMLKCVA FCDAIVEKGY IGVITLDNQD 

OV43 YFTKKD WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGVLTLDNQD 

BoCoV YFTKKD WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGILTLDNQD 

MHV YFQKKD WYDFVENSDI INVYKKLGPI FNRALLNTAK FADTLVEAGL VGVLTLDNQD 

AIBV HPKWFEENKD WYDPIENSKY YVMLAKMGPI VRRALLNAIE FGNLMVEKGY VGVITLDNQD 

SARS COV YFNKKD WYDFVENPDI LRVYANLGER VRQSLLKTVQ FCDAMRDAGI VGVLTLDNQD 

I I I 1 I I I I I I I I 

4805 4815 4825 4835 4845 4855 

EMCR LNGNFYDFGD FWSLPNMGV PCCTSYYSYM MPIMGLTNCL ASECFVKSDI FGSDFKTFDL 

229E LNGNFYDFGD FVLCPPGMGI PYCTSYYSYM MPVMGMTNCL ASECFMKSDI FGQDFKTFDL 

PEDV LNGDFYDFGD FTCSIKGMGV PICTSYYSYM MPVMGMTNCL ASECFVKSDI FGEDFKSYDL 

TGEV LNGNFYDFGD FVKTAPGFGC ACVTSYYSYM MPLMGMTSCL ESENFVKSDI YGSDYKQYDL 

OV43 LNGKWYDFGD YVIAAPGCGV AIADSYYSYI MPMLTMCHAL DCELYVNN — AYRLFDL 

BoCoV LNGKWYDFGD YVIAAPGCGV AIADSYYSYM MPMLTMCHAL DCELYVNN — AYRLFDL 

MHV LYGQWYDFGD FVKTVPGCGV AVADSYYSYM MPMLTMCHAL DSELFING TYREFDL 

AIBV LNGKFYDFGD FQKTAPGAGV PVFDTYYSYM MPIIAMTDAL APERYFEYDV H-KGYKSYDL 

SARS CoV LNGNWYDFGD FVQVAPGCGV PIVD5YYSLL MPILTLTRAL AAESHMDADL A-KPLIKHDL 

I I I I I I I I I I I I 

4865 4875 4885 4895 4905 4915 

EMCR LKYDFTEHKE NLFNKYFKHW SFDYHPNCSD CYDDMCVIHC ANFNTLFATT IPGTAFGPLC 

229E LKYDFTEHKE VLFNKYFKYW GQDYHPDCVD CHDEMCILHC SNFNTLFATT IPNTAFGPLC 

PEDV LEYDFTEHKT ALFNKYFKYW GLQYHPNCVD CSDEQCIVHC ANFNTLFSTT IPITAFGPLC 

TGEV LAYDFTEHKE YLFQKYFKYW DRTYHPNCSD CTSDECIIHC ANFNTLFSMT IPMTAFGPLV 

OV43 VQYDFTDYKL ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV 

BoCoV VQYDFTDYKL ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV 

MHV VQYDFTDFKL ELFNKYFKYW SMTYHPNTCE CEDDRCIIHC ANFNILFSMV LPKTCFGPLV 

AIBV LKYDYTEEKQ ELFQKYFKYW DQEYHPNCRD CSDDRCLIHC ANFNILFSTL IPQTSFGNLC 

SARS CoV LKYDFTEERL CLFDRYFKYW DQTYHPNCIN CLDDRCILHC ANFNVLFSTV FPPTSFGPLV 

I I I I I I I I I I I I 

4925 4935 4945 4955 4965 4975 

EMCR RKVFIDGVPL VTTAGYHFKQ LGLVWNKDVN THSVRLTITE LLQFVTDPSL IIASSPALVD 

229E RKVFIDGVPV VATAGYHFKQ LGLVWNKDVN THSTRLTITE LLQFVTDPTL IVASSPALVD 

PEDV RKCWIDGVPL VTTAGYHFKQ LGIVWNNDLN LHSSRLSINE LLQFCSDPAL LIASSPALVD 

TGEV RKVHIDGVPV VVTAGYHFKQ LGIVWNLDVK LDTMKLSMTD LLRFVTDPTL LVASSPALLD 

OV43 RQIFVDGVPF VVSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD 

BoCoV RQIFVDGVPF VVSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD 

MHV RQIFVDGVPF VVSIGYHYKE LGVVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALLD 

AIBV RKVFVDGVPF lATCGYHSKE LGVIMNQDNT MSFSKMGLSQ LMQFVGDPAL LVGTSNNLVD 

SARS CoV RKIFVDGVPF WSTGYHFRE LGVVHNQDVN LHSSRLSFKE LLVYAADPAM HAASGNLLLD 
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I I I I I.. ..I 1 1 I I I I 

4985 4995 5005 5015 5025 5035 

EMCR QRTICFSVAA LSTGLTNQVV KPGHFNEEFY NFLRLRGFFD EGSCLTLKHF FFAQNGDAAV 

229E KRTVCFSVAA LSTGLTSQTV KPGHFNKEFY DFLRSQGFFD EGSELTLKHF FFTQKGDAAI 

PEDV QRTVCFSVAA LGTGMTNQTV KPGHFNKEFY DFLLEQGFFS EGSELTLKHF FFAQKVDAAV 

TGEV QRTVCFSIAA LSTGITYQTV KPGHFNKDFY DFITERGFFE EGSELTLKHF FFAQGGEAAM 

OV43 LRTCCFSVAA ITSGVKFQTV KPGNFNQDFY DFVLSKGLLK EGSSVDLKHF FFTQDGNAAI 

BoCoV LRTCCFSVAA ITSGVKFQTV KPGNFNQDFY DFILSKGLLK EGSSVDLKHF FFTQDGNAAI 

MHV LRTCCFSVAA ITSGVKFQTV KPGNFNQDFY EFILSKGLLK EGSSVDLKHF FFTQDGNAAI 

AIBV LRTSCFSVCA LTSGITHQTV KPGHFNKDFY DFAEKAGMFK EGSSIPLKHF FYPQTGNAAI 

SARS COV KRTTCFSVAA LTNNVAFQTV KPGNFNKDFY DFAVSKGFFK EGSSVELKHF FFAQDGNAAI 

I I I I I I I I I I I I 

5045 5055 5065 5075 5085 5095 

EMCR KDFDFYRYNK PTILDICQAR VTYKIVSRYF DIYEGGCIKA CEVVVTNLNK SAGWPLNKFG 

229E KDFDYYRYNR PTMLDIGQAR VAYQVAARYF DCYEGGCITS REWVTNLNK SAGWPLNKFG 

PEDV KDFDYYRYNR PTVLDICQAR VVYQIVQRYF DIYEGGCITA KEVWTNLNK SAGYPLNKFG 

TGEV TDFNYYRYNR VTVLDICQAQ FVYKIVGKYF ECYDGGCINA REVWTNYDK SAGYPLNKFG 

OV43 TDYNYYKYNL PTMVDIKQLL FVLEVVYKYF EIYDGGCIPA SQVIVNNYDK SAGYPFNKFG 

BoCoV TDYNYYKYNL PTMVDIKQLL FVLEVVYKYF EIYDGGCIPA AQVIVNNYDK SAGYPFNKFG 

MHV TDYNYYKYNL PTMVDIKQLL FVLEWNKYF EIYDGGCIPA TQVIVNNYDK SAGYPFNKFG 

AIBV NDYDYYRYNR PTMFDICQLL FCLEVTSKYF ECYEGGCIPA SQVWNNLDK SAGYPFNKFG 

SARS CoV SDYDYYRYNL PTMCDIRQLL FWEWDKYF DCYDGGCINA NQVIVNNLDK SAGFPFNKWG 

f 1 I I I I I I I I I I 

5105 5115 5125 5135 5145 5155 

EMCR KASLYYESIS YEEQDALFAL TKRNVLPTMT QLNLKYAISG KERARTVGGV SLLSTMTTRQ 

229E KAGLYYESIS YEEQDAIFSL TKRNILPTMT QLNLKYAISG KERARTVGGV SLLATMTTRQ 

PEDV KAGLYYESLS YEEQDELYAY TKRNILPTMT QLNLKYAISG KERARTVGGV SLLSTMTTRQ 

TGEV KARLYYETLS YEEQDALFAL TKRNVLPTMT QMNLKYAISG KARARTVGGV SLLSTMTTRQ 

OV43 KARLYYEALS FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM 

BoCoV KARLYYEALS FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM 

MHV KARLYYEALS FEEQDEVYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM 

AIBV KARLYYEMS- LEEQDQLFEI TKKNVLPTIT QMNLKYAISA KNRARTVAGV SILSTMTNRQ 

SARS CoV KARLYYDSMS YEDQDALFAY TKRNVIPTIT QMNLKYAISA KNRARTVAGV SICSTMTNRQ 

I I I I I I I I I I I I 

5165 5175 5185 5195 5205 5215 

EMCR YHQKHLKSIV NTRNATWIG TTKFYGGWNN HLRTLIDGVE NPHLMGWDYP KCDRALPNMI 

22 9E FHQKCLKSIV ATRNATWIG TTKFYGGWDN MLKNLMAOVD DPKLMGWDYP KCDRAMPSMI 

PEDV YHQKHLKSIV NTRGASVVIG TTKFYGGWDN MLKNLIDGVE NPCLMGWDYP KCDRALPNMI 

TGEV YHQKHLKSIA ATRNATWIG STKFYGGWDN MLKNLMRDVD NGCLMGWDYP KCDRALPNMI 

OV43 FHQKCLKSIA ATRGVPVVIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNLL 

BoCoV FHQKCLKSIA ATRGVPVVIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNIL 

MHV FHQKCLKSIA ATRGVPVVIG TTKFYGGWDD MLRRLIKDVD SPVLMGWDYP KCDRAMPNIL 

AIBV FHQKILKSIV NTRNASVVIG TTKFYGGWDN MLRNLIQGVE DPILMGWDYP KCDRAMPNLL 

SARS CoV FHQKLLKSIA ATRGATVVIG TSKFYGGWHN MLKTVYSDVE TPHLMGWDYP KCDRAMPNML 

I I I I I I I I I I I I 

5225 5235 5245 5255 5265 5275 

EMCR RMISAMVLGS KHVNCCTVTD RFYRLGNELA QVLTEVVYSN GGFYFKPGGT TSGDASTAYA 

22 9E RMLSAMILGS KHVTCCTASD KFYRLSNELA QVLTEVVYSN GGFYFKPGGT TSGDATTAYA 

PEDV ElMISAMILGS KHTTCCSSTD RFFRLCNELA QVLTEVVYSN GGFYLKPGGT TSGDATTAYA 

TGEV RMASAMILGS KHVGCCTHND RFYRLSNELA QVLTEVVHCT GGFYFKPGGT TSGDGTTAYA 

OV43 RIVSSLVLAR KHETCCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA 

BoCoV RIVSSLVLAR KHEACCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA 

MHV RIISSLVLAR KHDSCCSHTD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA 

AIBV RIAASLVLAR KHTNCCSWSE RIYRLYNBCA QVLSETVLAT GGIYVKPGGT SSGDATTAYA 

SARS COV RIMASLVLAR KHNTCCNLSH RFYRLANECA QVLSEHVMCG GSLYVKPGGT SSGDATTAYA 

t I I I I I I I I I I 1 

5285 5295 5305 5315 5325 5335 

EMCR NSIFNIFQAV SSNINRLLSV PSDSCNNVNV RDLQRRLYDN CYRLTSVEES FIDDYYGYLR 

22 9E NSVFNIFQAV SSNINCVLSV NSSNCNNFNV KKLQRQLYDN CYRNSNVDES FVDDFYGYLQ 

PEDV NSVFNIFQAV SANVNKLLSV DSNVCHNLEV KQLQRKLYEC CYRSTIVDDQ FWEYYGYLR 

TGEV NSAFNIFQAV SANVNKLLGV DSNACNNVTV KSIQRKIYON CYRSSSIDEE FVVEYFSYLR 

OV43 NSVFNICQAV SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDKVDST FVTEYYEFLN 

BoCoV NSVFNICQAV SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDMVDST FVTEYYEFLN 

MHV NSVFNICQAV SANVCSLMAC NGHKIEDLSI RELQKRLYSN VYRADHVDPA FVNEYYEFLN 

AIBV NSVFNIIQAT SANVARLLSV ITRDIVYDNI KSLQYELYQQ VYRRVNFDPA FVEKFYSYLC 

SARS CoV NSVFNICQAV TANVNALLST DGNKIADKYV RNLQHRLYEC LYRNRDVDHE FVDEFYAYLR 

I I I I I 1 1 I I I I I 

5345 5355 5365 5375 5385 5395 

EMCR KHFSMMILSD DGVVCYNKDY AELGYIADIS AFKATLYYQN NVFMSTSKCW VEEDLTKGPH 

229E KHFSMMILSD DSVVCYNKTY AGLGYIADIS AFKATLYYQN GVFHSTAKCH TEEDLSIGPH 

PEDV KHFSMMILSD DGVVCYNNDY ASLGYVADLN AFKAVLYYQN NVFMSASKCW lEPDINKGPH 

TGEV KHFSMMILSD DGVVCYNKDY ADLGYVADIN AFBtATLYYQN NVFMSTSKCW VEPDLSVGPH 

OV43 KHFSMMILSD DGVVCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VEHDINNGPH 

BoCoV KHFSMMILSD DGVVCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VENDINNGPH 

MHV KHFSMMILSD DGVVCYNSEF ASKGYIANIS AFQQVLYYQN NVFMSEAKCW VETDIEKGPH 

AIBV KNFSLMILSD DGVVCYNNTL AKQGLVADIS GFREVLYYQN NVFMADSKCW VEPDLEKGPH 

SARS CoV KHFSMMILSD DAWCYNSNY AAQGLVASIK NFKAVLYYQN NVFMSEAKCW TETDLTKGPH 
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1 I 1 I I. ...I I I I I I I 

5405 5415 5425 5435 5445 5455 

EMCR EFCSQHTMQI VDKDGTYYLP YPDPSRILSA GVFVDDWKT DAWLLXRYV SLAIDAYPLS 

229E EFCSQHTMQI VDENGKYYLP YPDPSRIISA GVFVDDITKT DAVILLERYV SLAIDAYPLS 

PEDV EFCSQHTMQI VDKEGTYYLP YPDPSRILSA GVFVDDWKT DAWLLERYV SLAIDAYPLS 

TGEV EFCSQHTLQI VGPDGDYYLP YPDPSRILSA GVFVDDIVKT DNVIMLERYV SLAIDAYPLT 

OV43 EFCSQHTMLV KMDGDDVYLP YPNPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV 

BoCoV EFCSQHTMLV KMDGDDVYLP YPVPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV 

MHV EFCSQHTMLV KMDGDEVYLP YPDPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV 

AIBV EFCSQHTMLV EVDGEPKYLP YPDPSRILGA CVFVDDVDKT EPVAVMERYI ALAIDAYPLV 

SARS CoV EFCSQHTMLV KQGDDYVYLP YPDPSRILGA GCFVDDIVKT DGTLMIERFV SLAIDAYPLT 

I I I I 1 I I I I I I I 

5465 5475 5485 5495 5505 5515 

EMCR KHPNSEYRKV FYVLLDWVKH LNKNLNEGVL ESFSVTLLDN QEDKFWCEDF YASMYENSTI 

229E KHPKPEYRKV FYALLDWVKH LNKTLNEGVL ESFSVTLLDE HESKFWDESF YASMYEKSTV 

PEDV KHENPEYKKV FYVLLDWVKH LYKTLNAGVL ESFSVTLLED STAKFWDESF YANMYEKSAV 

TGEV KHPKPAYQKV FYTLLDWVKH LQKNLNAGVL DSFSVTMLEE GQDKFWSEEF YASLYEKSTV 

OV43 YHENEEYQKV FRVYLAYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV 

BoCoV YHENEEYQKV FRVYLEYIKK LYNELGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV 

MHV YHENPEYQNV FRVYLEYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDETF YKNMYLRSAV 

AIBV HHENEEYKKV FFVLLAYIRK LYQELSQNML MDYSFVMDID KGSKFWEQEF YENMYRAPTT 

SARS CoV KHPNQEYADV FHLYLQYIRK LHDELTGHML DMYSVMLTND NTSRYWEPEF YEAMYTPHTV 

I I I I I I |..,.| 1 I 1 I 

5525 5535 5545 5555 5565 5575 

EMCR LQAAGLCWC GSQTVLRCGD CLRKPMLCTK CAYDHVFGTD HKFILAITPY VCNASGCGVS 

229E LQAAGLCWC GSQTVLRCGD CLRRPMLCTK CAYDHVFGTD HKFILAITPY VCNTSGCNVN 

PEDV LQSAGLCWC GSQTVLRCGD CLRRPMLCTK CAYDHVIGTT HKFILAITPY VCCASDCGVN 

TGEV LQAAGMCWC GSQTVLRCGD CLRRPLLCTK CAYDHVMGTK HKFIMSITPY VCSFNGCNVN 

OV43 MQSVGACWC SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN 

BoCoV MQSVGACWC SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN 

MHV MQSVGACWC SSQTSLRCGS CIRKPLLCCK CAYDHVMSTD HKYVLSVSPY VCNSPGCDVN 

AIBV LQSCGVCWC NSQTILRCGN CIRKPFLCCK CCYDHVMHTD HKNVLSINPY ICSQLGCGEA 

SARS CoV LQAVGACVLC NSQTSLRCGA CIRRPFLCCK CCYDHVISTS HKLVLSVNPY VCNAPGCDVT 

I I I 1 I I I I 1 I I I 

5585 5595 5605 5615 5625 5635 

EMCR DVKKLYLGGL NYYCTNHKPQ LSFPLCSAGN IFGLYKNSAT 6SLDVEVFNR LATSDWTDVR 

229E DVTKLYLGGL NYYCVDHKPH LSFPLCSAGN VFGLYKSSAL GSMDIDVFNK LSTSDW5DIR 

PEDV DVTKLYLGGL SYWCHEHKPR LAFPLCSAGN VFGLYKNSAT GSPDVEDFNR lATSDWTDVS 

TGEV DVTKLFLGGL SYYCHNHKPQ LSFPLCANGN VFXjLYKSSAV GSEAVEDFNK LAVSDWTNVE 

GV43 DVTKLYLGGM SYYCEDHKPQ YSFKLVMNGL VFGLYKQSCT GSPYIDDFNR lASCKWTDVD 

BoCoV DVTKLYLGGM SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIDDFNR lASCKWTDVD 

MHV DVTKLYLGGM SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIEDFNK lASCKWTEVD 

AIBV DVTKLYLGGM SYFCGNHKPK LSIPLVSNGT VFGIYRANCA GSENVDDFNQ LATTNWSIVE 

SARS CoV DVTQLYLGGM SYYCKSHKPP ISFPLCANGQ VFGLYKNTCV GSDNVTDFNA lATCDWTNAG 

I I I I I I I 1 I I I I 

5645 5655 5665 5675 5685 5695 

EMCR DYKLANDVRD TLRLFAAETI KAKEESVKSS YAFATLKEW GPKELLLSWE SGKVKPPLNR 

229E DYKLANDAKE SLRLFAAETV KAKEESVKSS YAYATLKEIV GPKELLLLWE SGKAKPPLNR 

PEDV DYRLANDVKD SLRLFAAETI KAKEESVKSS YACATLHEW GPKELLLKWE VGRPKPPLNR 

TGEV DYKLANNVKE SLKIFAAETV KAKEESVKSE YAYAVLKEVI GPKEIVLQWE ASKTKPPLNR 

OV43 DYILANECTE RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK 

BoCoV DYILANECTE RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK 

MHV DYVLANECTE RLKLFAAETQ KATEESFKQC YASATIREIV SDRELILSWE IGKVRPPLNK 

AIBV PYILANRCSD SLRRFAAETV KATEELHKQQ FASAEVREVF SDRELILSWE PGKTRPPLNR 

SARS CoV DYILANTCTE RLKLFAAETL KATEETFKLS YGIATVREVL SDRELHLSWE VGKPRPPLNR 

1 1 I I I I 1 I I I I I 

5705 5715 5725 5735 5745 5755 

EMCR NSVFTCFQIS KDSKFQIGEF IFEKVEYGSD TVTYKSTVTT KLVPGMIFVL TSHNVQPLRA 

22 9E NSVFTCFQIT KDSKFQVGEF VFEKVDYGSD TVTYKSTATT KLVPGMLFIL TSHNVAPLRA 

PEDV NSVFTCYHIT KNTKFQIGEF VFEKAEYDND AVTYKTTATT KLVPGMVFVL TSHNVQPLRA 

TGEV NSVFTCFQIS KDTKIQLGEF VFEQSEYGSD SVYYKSTSTY KLTPGMIFVL TSHNVSPLKA 

OV43 NYVFTGYHFT KNGKTVLGEY VFDKSELT-N GVYYRATTTY KLSVGDVFVL TSHSVANLSA 

BoCoV NYVFTGYHFT KNGKTVLGEY VFDKSELT-N GVYYRATTTY KLSVGDVFVL TSHSVANLSA 

MHV NYVFTGYHFT SNGKTVLGBY VFDKSELT-N GVYYRATTTY KLSVGDVFIL TSHAVSSLSA 

AIBV NYVFTGYHFT RTSKVQLGDF TFEKGEGK-D WYYKATSTA KLSVGDIFVL TSHNVVSLVA 

SARS COV NYVFTGYRVT KNSKVQIGEY TFEKGDYG-D AWYRGTTTY KLNVGDYFVL TSHTVMPLSA 

I I I I I I 1 I I I I I 

5765 5775 5785 5795 5805 5815 

EMCR PTIANQEKYS SIYKLHPAFN VSDAYANLVP YYQLIGKQKI TTIQGPPGSG KSHCSIGLGL 

229E PTMANQEKYS TIYKLHPSFN VSDAYANLVP YYQLIGKQRI TTIQGPPGSG KSHCSIGIGV 

PEDV PTIANQERYS TIHKLHPAFN IPEAYSSLVP YYQLIGKQKI TTIQGPPGSG KSHCVIGLGL 

TGEV PILVNQEKYN TISKLYPVFN lAEAYNTLVP YYQMIGKQKF TTIQGPPGSG KSHCVIGLGL 

OV43 PTLVPQENYS SIR-FASVYS VLETFQNNVV NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV 

BoCoV PTLVPQENYS SIR-FASVYS VLETFQNNVV NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV 

MHV PTLVPQENYT SIR-FASVYS VPETFQNNVP NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV 

AIBV PTLCPQQTFS RFVNLRPNVM VPECFVNNIP LYHLVGKQKR TTVQGPPGSG KSHFAIGLAV 

SARS CoV PTLVPQEHYV RITGLYPTLN ISDEFSSNVA NYQKVGMQKY STLQGPPGTG KSHFAIGLAL 
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I 1 I I I I I I I I ! I 

5825 5835 5845 5855 5865 5875 

EMCR YYPGARIVFV ACAHAAVDSL CAKAMTVYSI DKCTRIIPAR ARVECYSGFK PNNTSAQYIF 

229E YYPGARIVFT ACSHAAVDSL CAKAVTAYSV DKCTRIIPAR ARVECYSGFK PNNNSAQYVF 

PEDV YYPGARIVFT ACSHAAVDSL CVKASTAYSN DKCSRIIPQR ARVECYDGFK SNNTSAQYLF 

TGEV YYPQARIVYT ACSHAAVDAL CEKAAKNFNV DRCSRIIPQR IRVDCYTGFK PNNTNAQYLF 

OV43 FYCTARVVYT AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF 

BOCOV YYCTARWYT AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF 

MHV YYCTARWYT AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVDCYDKFK VNDTTRKYVF 

AIBV YFSSARWFT ACSHAAVDAL CEKAFKFLKV DDCTRIVPQR TTVDCFSKFK ANDTGKKYIF 

SARS COV YYPSARIVYT ACSHAAVDAL CEKALKYLPI DKCSRIIPAR ARVECFDKFK VNSTLEQYVF 

I I I I I I I I I I I I 

5885 5895 5905 5915 5925 5935 

EMCR STVNALPECN ADIVWDEVS MCTNYDLSVI NQRLSYKHIV YVGDPQQLPA PRVMITKGVM 

229E STVNALPEVN ADIWVDEVS MCTNYDLSVI NQRISYKHIV YVGDPQQLPA PRVLISKGVM 

PEDV STVNALPECN ADIWVDEVS MCTNYDLSVI NQRISYRHVV YVGDPQQLPA PRVMISRGTL 

TGEV CTVNALPEAS CDIVVVDEVS MCTNYDLSVI NSRLSYKHIV YVGDPQQLPA PRTLINKGVL 

OV43 TTINALPEMV TDIVWDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL 

BoCoV TTINALPEMV TDIVWDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL 

MHV TTINALPELV TDIIVVDEVS MLTNYELSVI NSRVRAKHYV YIGDPAQLPA PRVLLNKGTL 

AIBV STINALPEVS CDILLVDEVS MLTNYELSFI NGKINYQYVV YVGDPAQLPA PRTLLN-GSL 

SARS CoV CTVNALPETT ADIWFDEIS MATNYDLSW NARLRAKHYV YIGDPAQLPA PRTLLTKGTL 

I I I I I 1 I I I I I I 

5945 5955 5965 5975 5985 5995 

EMCR EPVDYNWTQ RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKPA SKQCFKIFFK 

229E EPIDYNVVTQ RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKEA SKQCFKIFER 

PEDV EPKDYNWTQ RMCALKPDVF LHKCYRCPAE IVRTVSEMVY ENQFIPVHPD SKQCFKIFCK 

TGEV QPQDYNVVTK RMCTLGPDVF LHKCYRCPAE IVKTVSALVY ENKFVPVNPE SKQCFKMFVK 

OV43 EPKYFNTVTK LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK 

BoCoV EPKYFNTVTK LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK 

MHV EPRYFNSVTK LMCCLGPDIF LGTCYRCPKE IVDTVSALVY HNKLKAKNDN SSMCFKVYYK 

AIBV SPKDYNWTN LMVCVKPDIF LAKCYRCPKE IVDTVSTLVY DGKFIANNPE SRECFKVIVN 

SARS CoV EPEYFNSVCR LMKTIGPDMF LGTCRRCPAE IVDTVSALVY DNKLKAHKDK SAQCFKMFYK 

1 I 1 I 1 I I 1 1 I I I 

6005 6015 6025 6035 6045 6055 

EMCR GNVQVDN GSSINRKQLE IVKLFLVKNP SWSKAVFISP YNSQNYVASR FLGLQIQTVD 

22 9E GSVQVDN GSSINRRQLD WKRFIHKNS TWSKAVFISP YNSQNYVAAR LLGLQTQTVD 

PEDV GNVQVDN GSSINRRQLD WRMFLAKNP RWSKAVFISP YNSQNYVASR LLGLQIQTVD 

TGEV GQVQIES NSSINNKQLE WKAFLAHNP KWRKAVFISP YNSQNYVARR LLGLQTQTVD 

OV43 GVTTHES SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD 

BoCoV GVTTHES SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD 

MHV GQTTHES SSAVNMQQIY LISKFLKANP SWSNAVFISP YNSQNYVAKR VLGLQTQTVD 

AIBV NGNSDVGHES GSAYNTTQLE FVKDFVCRNK QWREAIFISP YNAMNQRAYR MLGLNVQTVD 
SARS CoV GVITHDV SSAINRPQIG WREFLTRNP AWRKAVFISP YNSQNAVASK ILGLPTQTVD 

I I I ( I I I I I I I I 

6065 6075 6085 6095 6105 6115 

EMCR SSQGSEYDYV lYAQTSDTAH ACNVNRFNVA ITRAKKGIFC VMCDKT-LFD SLKFFEIKHA 

229E SAQGSEYDYV IFAQTSDTAH ACNANRFNVA ITRAKKGIFC IMSDRT-LFD ALKFFEITMT 

PEDV SSQGSEYDYV lYAQTSDTAH ASNVNRFNVA ITRAKKGILC IMCDRS-LFD LLKFFELKLS 

TGEV SAQGSEYDYV lYTQTSDTQH ATNVNRFNVA ITRAKVGILC IMCDRT-MYE NLDFYELKDS 

OV43 SAQGSEYDYV lYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQ-LFE ALQFTTLTLD 

BoCoV SAQGSEYDYV lYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQ-LFE ALQFTTLTVD 

MHV SAQGSEYDFV lYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSSMQ-LFE SLNFSTLTLD 

AIBV SSQGSEYDYV IFCVTADSQH ALNINRFNVA LTRAKRGILV VMRQRDELYS ALKFTELDSE 

SARS CoV SSQGSEYDYV IFTQTTETAH SCNVNRFNVA ITRAKIGILC IMSDRD-LYD KLQFTSLEIP 

I I I I I I I ! 1 I I 1 

6125 6135 6145 6155 6165 6175 

EMCR DLHSS -QVCGLFKNC TRTPLNLPPT HAHTFLSLSD QFKTTGDLAV QIGS-N-NVC 

22 9E DLQSE -SSCGLFKDC ARNPIDLPPS HATTYLSLSD RFKTSGDLAV QIGN~N-NVC 

PEDV DLQAN -EGCGLFKDC SRGDDLLPPS HANTFMSLAD NFKTDQYLAV QIGV-N-GPI 

TGEV KI GLQAK PETCGLFKDC SKSEQYIPPA YATTYMSLSD NFKTSDGLAV NIG — T-KDV 

OV43 KVPQAVETKV QCSTNLFKDC SKSYSGYHPA HAPSFLAVDD KYKATGDLAV CLGIGD*SAV 

BoCoV KVPQAVETRV QCSTNLFKDC SKSYSGYHPA HAPSFLAVDD KYKATGDLAV CLGIGD-SAV 

MHV KIN NPRL QCTTNLFKDC SRSYAGYHPA HAPSFLAVDD KYKVGGDLAV CLNVAD-SAV 

AIBV T S— LQGTGLFKIC NKEFSGVHPA YAVTTKALAA TYKVNDELAA LVNVEAGSBI 

SARS CoV RRN-VATLQA ENVTGLFKDC SKIITGLHPT QAPTHLSVDI KFKTEG-LCV DIPGIP-KDM 

I I I I I 1 I I ! I I I 

6185 6195 6205 6215 6225 6235 

EMCR TYEHVISFMG FRFDISIPGS HSLFCTRDFA IRNVRGWLGM DVESAHVCGD NIGTNVPLQV 

229E TYEHVISYMG FRFDVSMPGS HSLFCTRDFA MRHVRGWLGM DVEGAHVTGD NVGTNVPLQV 

PEDV KYEHVISFMG FRFDINIPNH HTLFCTRDFA MRNVRGWLGF DVEGAHVVGS NVGTNVPLQL 

TGEV KYANVISYMG FRFEANIPGY HTLFCTRDFA MRNVRAWLGF DVEGAHVCGD NVGTNVPLQL 

OV4 3 TYSRLISLMG FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL 

BoCoV TYSRLISLMG FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL 

MHV TYSRLISLMG FKLDLTLDGY CKLFITRDEA IRRVRAWVGF DAEGAHATRD SIGTNFPLQL 

AIBV TYKHLISLLG FKMSVNVEGC HNMFITRDEA IRNVRGWVGF DVEATHACGT NIGTNLPFQV 

SARS CoV TYRRLISMMG FKMNYQVNGY PNMFITREEA IRHVRAWIGF DVEGCHATRD AVGTNLPLQL 
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I I I I I I I I I I I I 

6245 6255 6265 6275 6285 6295 

EMCR GFSNGVNFW QTEGCVSTNF GDVIKPVCAK SPPGBQFRHL VPFLRKGQPW LIVSUIRIVQM 

22 9E GFSNGVDFVA QPEGCVLTNT GSWKPVRAR APPGEQFTHI VPLLRKGQPW SVLRKRIVQM 

PEDV GFSNGVDFVV RPEGCWTES GDYIKPVRAR APPGEQFAHL LPLLKRGQPW DVVRKRIVQM 

TGEV GFSNGVDFVV QTEGCVITEK GNSIEVVKAR APPGEQFAHL IPLMRKGQPW HIVRRRIVQM 

OV43 GFSTGIDFW EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGHRW DWRPRIVQM 

BOCOV GFSTGIDFW EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGQRW DWRPRIVQM 

MHV GFSTGIDFW EATGMFAERD GYVFKKAVAR APPGEQFKHL VPLMSRGQKW DVVRIRIVQM 

AIBV GFSTGADFVV TPEGLVDTSI GNNFEPVNSK APPGEQFNHL RVLFKSAKPW HVIRPRIVQM 

SARS CoV GFSTGVNLVA VPTGYVDTEN NTEFTRVNAK PPPGDQFKHL IPLMYKGLPW NWRIKIVQM 

I I 1 I I I I I I I I I 

6305 6315 6325 6335 6345 6355 

EMCR ISDYLSNLSD ILVFVLWAGS LELTTMRYFV KIGP-IKYCY CGNSATCYNS VSNEYCCFKH 

229E lADFLAGSSD VLVFVLWAGG LELTTMRYFV KIGA-VKHCQ CGTVATCYNS VSNDYCCFKH 

PEDV CSDYLANLSD ILIFVLWAGG LELTTMRYFV KIGP-SKSCD CGKVATCYNS ALHTYCCFKH 

TGEV VCDYFDGLSD ILIFVLWAGG LELTTMRYFV KIGR-PQKCE CGKSATCYSS SQSVYACFKH 

OV43 FADHLIDLSD CVVLVTWAAN FELTCLRYFA KVGREISCNV CTKRATVYNS RTGYYGCWRH 

BoCoV FADHLIDLSD CVVLVTWAAN FELTCLRYFA KVGREISCNV STKRATAYNS RTGYYGCWRH 

MHV LSDHLVDLAD SVVLVTWAAS FELTCLRYFA KVGKEVVCSV CNKRATCFNS RTGYYGCWRH 

AIBV LADNLCNVSD CVVFVTWCHG LELTTLRYFV KIGK-EQVCS CGSRATTFNS HTQAYACWKH 

SARS CoV LSDTLKGLSD RWFVLWAHG FELTSMKYFV KIGPERTCCL CDKRATCFST SSDTYACWNH 

I I I I I I I I 1 I I 1 

6365 6375 6385 6395 6405 6415 

EMCR ALGCDYVYNP YAFDIQQWGY VGSLSQNHHT FCNIHRNEHD ASGDAVMTRC LAVHDCFVKN 

22 9E ALGCDYVYNP YVIDIQQWGY VGSLSTNHHA ICNVHRNEHV ASGDAIMTRC LAVYDCFVKN 

PEDV ALGCDYLYNP YCIDIQQWGY KGSLSLNHHE HCNVHRNEHV ASGDAIMTRC LAIHDCFVKN 

TGEV ALGCDYLYNP YCIDIQQWGY TGSLSMNHHE VCNIHRNEHV ASGDAIMTRC LAIHDCFVKR 

OV43 SVTCDYLYNP LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN 

BoCoV SVTCDYLYNP LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN 

MHV SYSCDYLYNP LIVDIQQWGY TGSLTSNHDL ICSVHKGAHV ASSDAIMTRC LAVHDCFCKS 

AIBV CLGFDFVYNP LLVDIQQWGY SGNLQFNHDL HCNVHGHAHV ASVDAIMTRC LAINNAFCQD 

SARS CoV SVGFDYVYNP FMIDVQQWGF TGNLQSNHDQ HCQVHGNAHV ASGDAIMTRC LAVHECFVKR 

I I I I I 1 j I I I I I 

6425 6435 6445 6455 6465 6475 

EMCR VDWTVTYPFI ANEKFINGCG RNVQGHVVRA ALRLYKPSVI HDIGNPKGVR CA-VTDAKWY 

229E VDWSITYPMI ANENAINKGG RTVQSHIMRA AIKLYNPKAI HDIGNPKGIR CA-VTDAKWY 

PEDV VDWSITYPFI GNEAVINKSG RIVQSHTMRS VLKLYNPKAI YDIGNPKGIR CA-VTDAKWF 

TGEV VDWSIVYPFI DNEEKINKAG RIVQSHVMKA ALKIFNPAAI HDVGNPKGIR CA-TTPIPWF 

OV43 INWNVEYPII SNELSINTSC RVLQRVILKA AMLCNRYTLC YDIGNPKAIA CV — KDFDFK 

BoCoV INWNVEYPII SNELSINTSC RVLQRVMLKA AMLCNRYTLC YDIGNPKAIA CV — KDFDFK 

MHV VNWSLEYPII SNEVSVNTSC RLLQRVMFRA AMLCNRYDVC YDIGNPKGLA CV— KGYDFK 

AIBV VNWDLTYPHI ANEDEVNSSC RYLQRMYLNA CVDALKVNW YDIGNPKGIK CVRRGDVNFR 

SARS COV VDWSVEYPII GDELRVNSAC RKVQHMWKS ALLADKFPVL HDIGNPKAIK CVPQAEVEWK 

I I I I I I I I I I I I 

6485 6495 6505 6515 6525 6535 

EMCR CYDKQPVNSN VKLLDYD YATHG— QLD GLCLFWNCNV DMYPEFSIVC RFDTRTRSVF 

229E CYDKNPINSN VKTLEYD YMTHG — QMD GLCLFWNCNV DMYPEFSIVC RFDTRTRSTL 

PEDV CFDKNPTNSN VKTLEYD YITHG — QFD GLCLFWNCNV DMYPEFSVVC RFDTRCRSPL 

TGEV CYDRDPINNN VRCLDYD YMVHG — QMN GLMLFWNCNV DMYPEFSIVC RFDTRTRSKL 

OV43 FYDAQPIVKS VKTLLYS FEAHKDSFKD GLCMFWNCNV DKYPPNAVVC RFDTRVLNNL 

BoCoV FYDAQPIVKS VKTLLYF FEAHKDSFKD GLCMFWNCNV DKYPPNAVVC RFDTRVLNNL 

MHV FYDASPVVKS VKQFVYK YEAHKDQFLD GLCMFWNCNV DKYPANAWC RFDTRVLNKL 

AIBV FYDKNPIVRN VKQFEYD YNQHKDKFAD GLCMFWNCNV DCYPDNSLVC RYDTRNLSVF 

SARS CoV FYDAQPCSDK AYKIEELFYS YATHHDKFTD GVCLFWNCKV DRYPANAIVC RFDTRVLSNL 

I I I I t I I I I I I I 

6545 6555 6565 6575 6585 6595 

EMCR NLEGVNGGSL YVNKHAFHTP AYDKRAFVKL KPMPFFYFDD SDCDVVQ -EQVNYVPLR 

22 9E NLEGVNGGSL YVNNHAFHTP AYDKRAMAKL KPAPFFYYDD GSCEVVH -DQVNYVPLR 

PEDV NLEGCNGGSL YVNNHAFHTP AFDKRAFAKL KPMPFFFYDD TECDKLQ -DSINYVPLR 

TGEV SLEGCNGGAL YVNNHAFHTP AYDRRAFAKL KPMPFFYYDO SNCELVD GQPNYVPLK 

OV43 NLPGCNGGSL YVNKHAFHTK PFARAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK 

BoCoV NLPGCNGGSL YVNKHAFHTK PFSRAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK 

MHV NLPGCNGGSL YVNKHAFHTS PFTRAAFENL KPMPFFYYSD TPCVYMEGME SKQVDYVPLR 

AIBV NLPGCNGGSL YVNKKAFYTP KFDRISFRNL KAMPFFFYDS SPCETIQVDG -VAQDLVSLA 

SARS CoV NLPGCDGGSL YVNKHAFHTP AFDKSAFTNL KQLPFFYYSD SPCESHGKQV VSDIDYVPLK 

I I I I I I I I I I I I 

6605 6615 6625 6635 6645 6655 

EMCR ASSCVTRCNI GGAVCSKHAN LYQKYVEAYN TFTQAGFNIW VPHSFDVYNL WQIFIET-NL 

22 9E ATNCITKCNI GGAVCSKHAN LYRAYVESYN IFTQAGFNIW VPTTFDCYNL WQTFTEV-NL 

PEDV ASNCITKCNV GGAVCSKHCA MYHSYVNAYN TFTSAGFTIW VPTSFDTYNL WQTFSN — NL 

TGEV SNVCITKCNI GGAVCKKHAA LYRAYVEDYN IFMQAGFTIW CPQNFDTYML WHGFVNSKAL 

OV43 SATCITRCNL GGAVCLKHAE EYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L 

BoCOV SATCITRCNL GGAVCLKHAE EYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L 

MHV SATCITRCNL GGAVCLKHAE DYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTR L 

AIBV TKDCITKCNI GGAVCKKHAQ MYAEFVTSYN AAVTAGFTFW VTNKLNPYNL WKSFSA L 

SARS CoV SATCITRCNL GGAVCRHHAN EYRQYLDAYN MMISAGFSLW lYKQFDTYNL WNTFTR L 
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....I. ...I ....1 1 I I I I I I I I 

6665 6675 6685 6695 6705 6715 

EMCR QSLENIAFNV VKKGCFTGVD GELPVAWHD KVFVRYGDVD NLVFTNKTTL PTNVAFELFA 

229E QGLENIAFNV VNKGSEVGAD GELPVAISGD KVFVRDGNTD NLVFVNKTSL PTNIAFELFA 

PEDV QGLENIAFNV LKKGSFVGDE GELPVAWND KVLVRDGTVD TX.VFTNKTSL PTNVAFELYA 

TGEV QSLENVAFNV VKKGAFTGLK GDLPTAVIAD KIMVRDGPTD KCIFTNKTSL PTNVAFELYA 

OV43 QSLENVVYNL VKTGHYTGQA GEMPCAIIND KVVAKIDKED WIFINNTTY PTNVAVELFA 

BOCOV QSLENVVYNL VKTGHYTGQA GEMPCAIIND KVVAKIDKED WIFINNTTY PTNVAVELFA 

MHV QSLENVVYNL VNAGHFDGRA GELPCAVIGE KVIAKIQNED WVFKNNTPF PTNVAVELFA 

AIBV QSIDNIAYNM YKGGHYDAIA GEMPTVITGD KVFVIDQGVE KAVFVNQTTL PTSVAFELYA 

SARS CoV QSLENVAYNV VNKGHFDGHA GEAPVSIINN AVYTKVDGID VEIFENKTTL PVNVAFELWA 



I 1 I I I I I I I I I I 

6725 6735 6745 6755 6765 6775 

EMCR KRKMGLTPPL SILKNLGWA TYKFVLWDYE ABRPFTSYTK SVCKYTDFN- EDV 

229E KRKVGLTPPL SILKNLGWA TYKFVLWDYE AERPLTSFTK SVCGYTDFA- EDV 

PEDV KRKVGLTPPI TILRNLGVVC TSKCVIWDYE AERPLTTFTK DVCKYTDFE- GDV 

TGEV KRKLGLTPPL TILRNLGVVA TYKFVLWDYE AERPFSNFTK QVCSYTDLD- SEV 

OV43 KRSVRHHPEL KLFRNLNIDV CWKHVIWDYA RESIFCSNTY GVCMYTDLK- FIDKL 

BoCoV KRSIRHHPEL KLFRNLNIDV CWKHVIWDYA RESIFCSNTY GVCMYTDLK- LIDKL 

MHV KRSIRPHPEL KLFRNLNIDV CWSHVLWDYA KDSVFCSSTY KVCKYTDLQ CIBSL 

AIBV KRNIRTLPNN RILKGLGVDV TNGFVIWDYA NQTPLYRNTV KVCAYTDIE- PNGL 



SARS CoV KRNIKPVPEI KILNNLGVDI AANTVIWDYK REAPAHVSTI GVCTMTDIAK KPTESACSSL 



I. ...I I I I I 1 I I I I I 

6785 6795 6805 6815 6825 6835 

EMCR CVCFDNSIQG SYERFTLTTN AVLFSTWIK N LTPIK LNFGMLNGMP VSSIKSDKGV 

22 9E CTCYDNSIQG SYERFTLSTN AVLFSATAVK TGGKSLPAIK LNFGMLNGNA lATVKSEDGN 

PEDV CTLFDNSIVG SLERFSMTQN AVLMSLTAVK K LTGIK LTYGYLNGVP VN THED- 

TGEV VTCFDNSIAG SFERFTTTRD AVLISNNAVK G LSAIK LQYGLLNDLP VS TVGN- 

OV43 NVLFDGRDNG ALEAFKRSNN GVYISTTKVK S LS MIRGPPRAEL NGWVDKVGD 

BoCoV NVLFDGRDNG ALEAFKRSNN GVYISTTKVK S LS MIRGPPRAEL NGVWDKVGD 

MHV NVLFDGRDNG ALBAFKKCRD GVYINTTKIK S LS MIKGPQRADL NGVWEKVGD 

AIBV WLYDDR-YG DYQSFLAADN AVLVSTQCYK R YS YVEIPSNLLV QNGMPLKDG- 

SARS CoV TVLFDGRVEG QVDLFRNARN GVLITEGSVK G LT PSKGPAQASV NGVTLIGES- 



1 I I I I I I I I I I I 

6845 6855 6865 6875 6885 6895 

EMCR EKLVNWYTYV RKNGQFQDHY DG FYTQ 

229E IKNINWFVYV RKDGKPVDHY DG FYTQ 

PEDV -KPFTWYIYT RKNGKFEDYP DG- YFTQ 

TGEV -KPVTWYIYV RKNGEYVEQI DS YYTQ 

OV43 -TDCVFYFAV RKEGQDVIFS QFDSLGVSSN QSPQGNLGSN GKPGNVGGND ALSISTIFTQ 

BoCoV -TDCVFYFAV RKEGQDVIFS QFDSLRVSSN QSPQGNLGSN -EPGNVGGND ALATSTIFTQ 

MHV -SDVEFWFAM RRDGDDVIFS RTGSLEPSHY RSPQGNPGGN -RVGDLSGNE ALARGTIFTQ 

AIBV ANLYVYK RVNGAFVTLP N TINTQ 

SARS CoV -VKTQFNYFK KVDG — HQ- QLP ETYFTQ 



I I I I I I I I I I I I 

6905 6915 6925 6935 6945 6955 

EMCR GRNLSDFTPR SDMEYDFLNM DMGVFINKYG LEDFNFEHW YGDVSKTTLG GLHLLISQFR 

229E GRNLQDFLPR STMEEDFLNM DIGVFIQKYG LEDFNFEHW YGDVSKTTLG GLHLLISQVR 

PEDV GRTTADFSPR SDMEKDFLSM DMGLFINKYG LEDYGFEHW YGDVSKTTLG GLHLLISQVR 

TGEV GRTFETFKPR STMEEDFLSM DTTLFIQKYG LEDYGFEHW FGDVSKTTIG GMHLLISQVR 

OV43 SRVISSFTCR TDMEKDFIAL DQDVFIQKYG LEDYAFEHIV YGNFNQKIIG GLHLLIGLYR 

BoCoV SRVISSFTCR TDMEKDFIAL DQDVFIQKYG LEDYAFEHIV YGNFNQKIIG GLHLLIGLYR 

MHV SRFLSSFAPR SEMEKDFMDL DEDVFIAKYS LQDYAFEHVV YGSFNQKIIG GLHLLIGLAR 

AIBV GRSYETFEPR SDIERDFLAM SEESFVERYG -KDLGLQHIL YGEVDKPQLG GLHTVIGMYR 

SARS CoV SRDLEDFKPR SQMETDFLEL AMDEFIQRYK LEGYAFEHIV YGDFSHGQLG GLHLMIGLAK 

I 1 1 I I I I 1 1 1 I I 

6965 6975 6985 6995 7005 7015 

EMCR LSKMGVLKAD DFVTASDTTL RCCTVTYLNE LSSKVVCTYM DLLLDDFVTI LK SLDLG 

229E LSKMGILKAE EFVAASDITL KCCTVTYLND PSSKTVCTYM DLLLDDFVSV LK SLDLT 

PEDV LACMGVLKID EFVSSNDSTL KSCTVTYADN PSSKMVCTYM DLLLDDFVSI LK SLDLS 

TGEV LAKMGLFSVQ EFMNNSDSTL KSCCITYADD PSSKNVCTYM DILLDDFVTI IK SLDLN 

OV43 RQQTSNLWQ EFVS-YDSSI HSYFITDEKS GGSKSVCTVI DILLDDFVAL VK SLNLN 

BoCoV RQQTSNLVIQ EFVS-YDSSI HSYFITDEKS GGSKSVCTVI DILLDDFVAL VK SLNLN 

MHV RQQKSNLVIQ EFVP-YDSSI HSYFITDENS GSSKSVCTVI DLLLDDFVDI VK SLNLN 

AIBV LLRANKLNAK SVTN-SDSDV MQNYFVLSDN GSYKQVCTW DLLLDDFLEL LRNILKEYGT 
SARS Gov RSQDSPLKLE DFIP-MDSTV KNYFITDAQT GSSKCVCSVI DLLLDDFVEI IK SQDLS 



1 1 1 I I I I I I 1 I I 

7025 7035 7045 7055 7065 7075 

EMCR VISKVHEVII DNKPYRWMLW CKDNHLSTFY PQLQS-AEWK CGYAMPQIYK LQRMCLBPCN 

229E VVSKVHEVII DNKPWRWMLW CKDNAVATFY PQLQS-AEWK CGYSMPGIYK TQRMCLEPCN 

PEDV VVSKVHEVMV DCKMWRWMLW CKDHKLQTFY PQLQA-SEHK CGYSMPSIYK IQRMCLEPCN 

TGEV WSKWDVIV DCKAWRWMLW CENSHIKTFY PQLQS-AEWN PGYSMPTLYK IQRMCLERCN 

OV43 CVSKVVNVNV DFKDFQFMLW CNDEKVMTFY PRLQAASDWK PGYSMPVLYK YLNSPMERVS 

BoCoV CVSKVVNVNV DFKDFQFMLW CNDEKVMTFY PRLQAASDWK PGYSMPVLYK YLNSPMERVS 

MHV CVSKVVNVNV DFKDFQFMLW CNEEKVMTFY PRLQAAADWK PGYVMPVLYK YLESPLERVN 

AIBV NKSKVVTVSI DYHSINFMTW FEDGSIKTCY PQLQS--AWT CGYNMPELYK VQNCVMEPCN 

SARS CoV VISKWKVTI DYAEISFMLW CKDGHVETFY PKLQASQAWQ PGVAMPNLYK MQRMLLEKCD 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 

76/87 



....I I I I ....( I I I I I I. ...I 

7085 7095 7105 7115 7125 7135 

EMCR LYNYGAGIKL PSGIMLNWK YTQLCQYLNS TTMCVPHNMR VLHYGAGSDK GVAPGTTVLK 

229E LYNYGAGLKL PSGIMFNWK YTQLCQYFNS TTLCVPHNMR VLHLGAGSDY GVAPGTAVLK 

PEDV LYNYGAGVKL PDGIMFNVVK YTQLCQYLNS TTMCVPHHMR VLHLGAGSDK GVAPGTAVLR 

TGEV LYNYGAQVKL PDGITTNWK YTQLCQYLNT TTLCVPHKMR VLHLGAAGAS GVAPGSTVLR 

OV43 LWNYGKPVTL PTGCMMNVAK YTQLCQYLNT TTLAVPVNMR VLHLGAGSEK GVAPGSAVLR 

BoCoV LWNYGKPVTL PTGCMMNVAK YTQLCQYLNT TTLAVPVNTR VLHLGAGSEK GVAPGSAVLR 

MHV LWNYGKPITL PTGCLMNVAK YTQLCQYLNT TTLAVPANMR VLHLGAGSDK DVAPGSAVLR 

AIBV IPNYGVGITL PSGILMNVAK YTQLCQYLSK TTICVPHNMR VMHFGAGSDK GVAPGSTVLK 

SARS CoV LQNYGENAVI PKGIMMNVAK YTQLCQYLNT LTLAVPYNMR VIHFGAGSDK GVAPGTAVLR 



I I I I I I I I I I I I 





7145 


7155 


7165 


7175 


7185 


7195 


EMCR 


RWLPPD 


All I 


DNDINDYVSD 


ADFSITGDCA 


TVYLEDKFDL 


LISDMYDG— 


229E 


RWLPHD 


AIVV 


DNDWDYVSD 


ADFSVTGDCA 


TVYLEDKFDL 


LISDMYDG— 


PEDV 




AIIV 


DNDSVDYVSD 


ADYSVTGDCS 


TLYLSDKFDL 


VISDMYDG — 


TGEV 


RWLPDD 


AILV 


DNDLRDYVSD 


ADFSVTGDCT 


SLYIEDKFDL 


LVSDLYDG — 


OV43 


QWLPAG 


TILV 


DNDLYPFVSD 


SVATYFGDCI 


TLPFDCQWDL 


IISDMYDP — 


BoCoV 


QWLPAGTILR QWLPAGTILV 


HNDLYPFVSD 


SVATYFGDCI 


TLPFDCQWDL 


IISDMYD 


MHV 


QWLPAG 


SXLV 


DNDINPFVSD 


SVASYYGNCI 


TLPIACQWDL 


IISDMYDP— 


AIBV 


QWLPEG 


TLLV 


DNDIVDYVSD 


AHVSVLSDCN 


KYNTEHKFDL 


VISDMYTDND 


SARS CoV 


QWLPTG 


-TLLV 


DSDLNDFVSD 


ADSTLIGDCA 


TVHTANKWDL 


IISDMYDP— 



I I I I I I I ! I I I I 

7205 7215 7225 7235 7245 7255 

EMCR — RIKFCDGE NVSKDGFFTY LNGVIREKLA IGGSVAIKIT EYSWNKYLYE LIQRFAFWTL 

229E — RTKAIDGE NVSKEGFFTY INGFICEKLA IGGSIAIKVT EYSWNKKLYE LVQRFSFWTM 

PEDV — KIKSCDGE NVSKEGFFPY INGVITEKLA LGGTVAIKVT EFSWNKKLYE LIQKFEYWTM 

TGEV — STKSIDGE NTSKDGFFTY INGFIKEKLS LGGSVAIKIT EFSWNKDLYE LIQRFEYWTV 

OV43 — ITKNIGEY NVSKDGFFTY ICHMIRDKLA LGGSVAIKIT EFSWNAELYK LMGYFAFWTV 

BoCoV — LLLDIGVH VVRCS YI HCHMIRDKLA LGGSVAIKIT EFSWNAELYK LMGYFAFWTV 

MHV — LTKNIGEY NVSKDGFFTY LCHLIRDKLA LGGSVAIKIT EFSWNAELYS LMGKFAFWTI 

AIBV SKRKHEGVIA NNGNDDVFIY LSSFLRNNLA LGGSFAVKVT ETSWHEVLYD lAQDCAWWTM 

SARS GOV — RTKHVTKE NDSKEGFFTY LCGFIKQKLA LGGSIAVKIT EHSWNADLYK LMGHFSWWTA 



I I I j I I I \ I I I I 

7265 7275 7285 7295 7305 7315 

EMCR FCTSVNTSSS EAFLIGINYL GDFIQGPFIA GNTVHANYIF WRNSTIMSLS YNSVLDLSKF 

229E FCTSVNTSSS EAFVVGINYL GDFAQGPFID GNIIHANYVF WRNSTVMSLS YNSVLDLSKF 

PEDV FCTSVNTSSS EAFLIGVHYL GDFASGAVID GNTMHANYIF WRNSTIMTMS YNSVLDLSKF 

TGEV FCTSVNTSSS EGFLIGINYL GPYCDKAIVD GNIMHANYIF WRNSTIMALS HNSVLDTPKF 

OV43 FCTNANASSS EGFLIGINYL CKPKV — EID GNVMHANYLF WRNSTVWNGG AYSLFDMAKF 

BoCoV FCTNANASSS EGFLIGINYL GKPKV — EID GNVMHAIICF G 

MHV FCTNVNASSS EGFLIGINWL NRTRT — EID GKTMHANYLF WRNSTMWNGG AYSLFDMSKF 

AIBV FCTAVNASSS EAFLIGVNYL GASEK-VKVS GKTLHANYIF WRNCNYLQTS AYSIFDVAKF 

SARS CoV FVTNVNASSS EAFLIGANYL GKPKE — QID GYTMHANYIF WRNTNPIQLS SYSLFDMSKF 



I I I I 

7325 7335 
EMCR ECKHKATVW TLKDSDVNDM 

229E NCKHKATVW QLKDSDINEM 

PEDV NCKHKATVW NLKDSSISDV 

TGEV KCRCNN7VLIV NLKEKELNEM 

OV43 PLKLAGTAVI NLRADQINDM 

BoCoV EIPQFGTGVL lACLIWLNSR 

MHV PLKVAGTAVV SLKPDQINDL 

AIBV DLRLKATPVV NLKTEQKTDL 

SARS CoV PLKLRGTAVM SLKENQINDM 



1 I I I I I 

7345 7355 7365 

VLSUKSGRL LLRNSGRFGG FSNHLVSTK- 

VLSLVRSGKL LVRGNGKCLS FSNHLVSTK- 

VLGLLKNGKL LVRNNDAICG FSNHLVNVNK 

VIGLLRKGKL LIRNNGKLLN FGNHFVNTP- 

VYSLLEKGKL LIRDTNKEVF VGDSLVNVI- 

LSWLVMP 

VLSLIEKGKL LVRDTRKEVF VGDSLVNVK- 

VFNLIKCGKL LVRDVGNTSF TSDSFVCTM- 

lYSLLEKGRL IIRENNRVW SSDILVNN— 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



77/87 



PCT/NL2004/000805 



e. Putative Spike protein 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat Gov 
PHEV 
AIBV 
SARS 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat Gov 
PHEV 
AIBV 
SARS 



EMCR S 

229E S 

PEDV 

TGEV 

GaCoV 

FeCoV 

Por Resp C 
OC43 
BoGoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



MKLFLI 



15 

LLILP 



. I 1 . 

25 

-L VSCFSTC- 



.1.. 

35 



I I.. 

45 

N SNASIS- 



. 1 . . 

55 



-ML 



MRSLIYFWLL LPVLP T LSLPQDV T 

MKKLFW LWMP L lYGDNFP C 

MIVLTLC LFLFL-YSSV SCTSNND C 

MIVLVTC LLLLCSYHTV LSTTNNE C 

MKKLFW LVVMP L lYG 

MFLIL LISLPTAFAV IGDLKCTSDT SYINDKDTGP 

MFLIL LISLPMAFAV IGDLKCT— T VSINDVDTGA 

MLFVF LTLLPSSLGY IGDFRCIQ-L VNTDTSNASA 

MLFVF LTLLPSCLGY IGDFRCIN-L VNTRISNARA 

MFFIL LISLPSAFAV IGDLKCT— T SLINDVDTGV 



RCQSTTNF — RRFFS 

SKLTNRTIGN QWNLIETFLL 
VQVNVTQLPG NENIIKDFLF 
IQVNVTQLAG NENLIRDFLF 



PPISTDTVDV 
PSISTDIVDV 
PSVSTEVVDV 
PSVSTEVVDV 
PSISSEVVDV 



TNGLGTYYVL 
TNGLGTYYVL 
SKGIGTYYVL 
SKGLGTYYVL 
TNGLGTFYVL 



-MFIFLL FLTLTSG SDLDR- 



-CTTFDDVQA PNYTQHTSSM 



I.... I 1 I 

65 75 
QLG— VPDNS STIVTGLLP- 



..| ( I.... I I I 1 I 

85 95 105 115 

-VHWICAN QSTSSYPANG FFYIDVG-KH RSAFALHSGY 



KFN — VQAPA WVLGGYLPS MNSSSWYCGT GIETASGVHG 
NYSSRLPPNS DVVLGDYFPT — VQPWFNCI RNDSNDLYVT 

QN FKEEG SLWGGYYP- — TEVWYNCS TTQQTTAYKY 

SN FKEEG SVWGGYYP- — TEVWYNCS RTARTTAFQY 



DR VYLNT TLFLNGYYPT SGSTYRNMAL KGSVLLSRLW 

DR VYLNT TLLLNGYYPT SGSTYRNMAL KGTLLLSRLW 

DR VYLNA TLLLTGYYPV DGSMYRNMAL TGINTISLNW 

DR VYLNA TLLLTGYYPV DGSMYRNMAL MGTNTLSLNW 

DR VYLNT TLLLNGYYPI SGATFRNMAL KGTRLLSTLW 



IFLSYIDSGQ 
LENLKALYWD 
FSNIHAFYFD 
FNNIHAFYFV 

D- 

FKPPFLSDFI 
FKPPFLSDFI 
YKPPFLSEFN 
FEPPFLSEFN 
FKPPFLSPFN 



GFEIGISQEP 
YATENITWN- 
MEAMENSTGN 
MEAMEN5TGN 



NGIFAKVKNT 
NGIFAKVKNT 
DGIFAKVKNL 
DGIYAKVKNL 
EX3IFAKVKNS 



RG VYYPD EIFRSDTLYL TQDLFLPFYS NVTGFHTINH TFGNPVIPFK DGIYFAATEK 



I 



125 135 
YDANQYYIYL TNKIH— 



145 



I 



I 



I . . 

155 165 
LNAPVTLKIC KFGN 



..I I 

175 
-TSFDFLS 



FDPSGYQLYL HKATNG N TNAIARLRIC QFPDN — KTLGPTVN 

-HRQRLNVW NGYPYSITV- TTTRN FNSAEGAIIC ICKGSPPTTT TESSLTCNWG 

ARGKPLLVHV HGNPVSIIVY ISAYRDDVQF RPLLKHGLLC ITKN— DTVD YNSFTINQWR 

ARGKPLLFHV HGEPVSVII SAYRDDVQQ RPLLKHGLVC ITKN— RHIN YEQFTSNQWN 



KVIKDRVMYS EFPAITIG- 
KVIKKGVMYS EFPAITIG- 
KASLPKDSIS YFPTIIIG- 
KASLPIGSAS YFPTIIIG- 
RFSKDGVIYS EFPAITIG- 



-STF VNTSYSWVQ PRTINSTQDG YNKLQGLLEV 

-STF VNTSYSWVQ PHTTN L DNKLQGLLEI 

-SNF VTTSYTWLE PYN GIIMA 

-SNF VNTSYTVVLE PYN GIIMA 

-STF VNTSYSIVVE PHTSL I NGNLQGLLQI 



SNWRGWVFG STMNNKSQS- 



-VII INNSTNWIR ACN FEL CDNPFFAVSK 



I I I I I I I I I I I I 

185 195 205 215 225 235 

EMCR S NVSTSHDCIV NLSFTEQL — GVPLGITISG ETVRLHLYNA TRTFYVPAAY KLTKLSVKCY 

229E S —MFVLLVAY ALLHIAG 

PEDV DVTTGRNCLF NKAIPAYMRD GKDIVVGITW DNDRVTVF-A DKIYHFYLKN DWSRVATRCY 

TGEV SECR-LNHKF PICPSNSEAN CGNMLYGLQW FADEVVAYLH GASYRISFEN QWSGTVTFGD 

CaGoV DICLGDDRKI PFSWPTDN- -GTKLFGLEW NDDYVTAYIS DESHRLNINN NWFNNVTLLY 

FeCoV STCTGADRKI PFSVIPTDN GTKIYGLEW NDDFVTAYIS GRSYHLNINT NWFNNVTLLY 

Por Resp C KF P 

OG43 SVGQYNMGEY PQTICHPNLG — NHRKELWH LDTGWSGLY KRNFTYDVNA DYLYFHFYQ- 

BoCoV SVCQYTMCEY PHTICHPKLG — NKRVELWH WDTGVVSCLY KRNFTYDVNA DYLYFHFYQ- 

MHV SICQYTICQL PYTDCKPNTG G-NKLIGFWH TELKSPVCIL KRNFTFNVNA EWLYFHFYQ- 

Rat Gov SICQYTICQL PHTDGKPNTG G-NTLIGFWH TDLRPPVCIL KRNFTFNVNA EWLYFHFYQ- 

PHEV SVCQYTMGEY PHTIGHPNLG — NQRIELWH YDTDVVSCLY RRNFTYDVNA DYLYFHFYQ- 

AIBV MLVTPLL LVTLLGALCS AVLYDSS 

SARS PMGTQTHTMI FDNAFNCTFE YISDAFSLDV SE-KSGNFKH LREFVFKNKD GFLYVYKG— 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



PCT/NL2004/000805 



78/87 



I. ...I I I I I I 1 I I I I 

245 255 265 275 285 295 

EMCR S FSESCVF SWNA T-ITVNVTT LNGRIVNYTV CDD CNGY TDNIFSVQQD 

22 9E S CQT TNGLNTSYSV CNG CVGY SENVFAVESG 

PEDV NRRSCAM QYVYTP TY-YMLNVTS AGEDGIYYEP CTAN— CTGY AANVFATDSN 

TGEV MRATTLEVAG TLVDLWWFNP VYDVSYYRVN NKNGTTVVSN CTD— QCASY VANVFTTQPG 

CaCoV SRTSTATWQH S — AAYVYQG VSNFTYYKLN KTAGLKSYEL CEDYEYCTGY ATNVFAPTSG 

FeCoV SRSSTATWEY S — AAYAYQG VSNFTYYKLN NTNGLKTYEL CEDYEHCTGY ATNVFAPTSG 

Por Resp C TSVVSN CTD— QCASY VANVFTILPG 

OC4 3 EGG TFYAYFTDT GWTKFLFNV YLG MALS HYYVMPLTCN 

BoCoV EGG -TFYAYFTDT GWTKFLFNV YLG TVLS HYYVLPLTCS 

MHV QGG TFYAYYADV SSATTFLFSM YIG DVLT QYFVLPYMCT 

Rat CoV QGG TFYAYYADV SSATTFLFSS YIG AVLT QYFVLPYMCS 

PHEV EGG TFYAYFTDT GFVTKFLFKL YLG TVLS HYYVMPLTCN 

AIBV S YVYYYQS AFRPPSGWHL QGG AYAV VNISSEFNNA 

SARS YQP IDWRDLPSG FNTLKPIFKL PLG-INITNF RAILTAFSPA 

I I I I I 1 I I 1 I I I 

305 315 325 335 345 355 

EMCR S GRIPNGFPFN NWFL-LTNGS TLVDGVSRLY QPLRLTCLWP VPGLKSSTGF VYFNATGSDV 

22 9E S GYIPSDFAFN NWFL-LTNTS SWDGVVRSF QPLLLNCLWS VSGLRFTTGF VYFNGTG-RG 

PEDV GHIPEGFSFN NWFL-LSNDS TLLHGKWSN QPLLVNCLLA IPKIYGLGQF FSFNHTM-DG 

TGEV GFIPSDFSFN NWFL-LTNSS TLVSGKLVTK QPLLVNCLWP VPSFEEAAST FCFEGAG-FD 

CaCoV GYIPDGFSFN NWFM-LTNSS TFVSGRFVTN QPLLVNCLWP VPSFGVAAQE FCFEGAQ-FS 

FeCoV GYIPDGFSFN NWFL-LTNSS TFVSGRFVTN QPLLINCLWP VPSFGVAAQE FCFEGAQ-FS 

Por Resp C GFIPSDFSFN NWFL-LTNSS TLVNGKLVTK QPLLVNCLWP VPSFEEVAST FCFEGAD-FD 

OC43 SKVKNGFTLE YWVTPLTSRQ YLLAFNQDGI IFNAVDCMSD FMSEIKCKTQ SIAPPTG-VY 

BoCoV S AMTLE YWVTPLTSKQ YLLAFNQDGV IFNAVDCKSD FMSEIKCKTL SIAPSTG-VY 

MHV LTTTGVFSPQ YWVTPLVKRQ YLFNFNQKGI ITSAVDCASS YTSEIKCKTQ SMNPNTG-VY 

Rat CoV PTTSGVSSPQ YWVTPLVKRQ YLFNFNQKGI ITSAVDCASS YTSEIKCKTQ SMNPNTG-VY 

PHEV S ALSLE YWVTPLTTRQ FLLAFDQDGV LYHAVDCASD FMSEIMCKTS SITPPTG-VY 

AIBV G SSS GCTVGIIHGG RWNASSIAM TAPSSGMAWS SSQFCTAHCN FSDTTVFVTH 

SARS QDIWGTSAAA YFVGYLKPTT FMLKYDENGT ITDAVDCSQN PLAELKCSVK SFEIDKG-IY 

....I I I I I I I I I I I I 

365 375 385 395 405 415 

EMCR S NCNGYQHNSV ADVMRYNLNL SANSVDNLKS GVIVFKTLQY -DVLFYCSN- — SS-SGVLD 

229E S DCKGFSSDVL SDVIRYNLNF EEN LRR GTILFKTSYG -VWFYCTN- — NT-LVSGD 

PEDV VCNGAAVDRA PEALRFNIND TSV ILAE GSIVLHTALG TNLSFVCSN SSDPHLAI 

TGEV QCNGAVLNNT VDVIRFNLNF TTNVQSGKGA TVFSLNTTGG VTLEISCY — TVSDSSFFSY 

CaCoV QCNGVSLNNT VDVIRFNLNF TTDVQSGMGA TVFSLNTTGG VILEISCYND TVSESSFYSY 

FeCoV QCNGVSLNNT VDVIRFNLNF TADVQSGMGA TVFSLNTTGG VILEISCYSD TVSESSSYSY 

Por Resp C QCNGAVLNNT VDVIRFNLNF TTNVQSGKGA TVFSLNTTGG VTLEISCYND TVSDSSFSSY 

OC43 ELNGYTVQPI ADVYRRKLNL PNCNIBAWLN DKSVPSPLNW ERKTFSNCNF NMSSLMSFIQ 

BoCoV ELNGYTVQPI ADVYRRIPNL PDCNIEAWLN DKSVPSPLNW ERKTFSNCNF NMSSLMSFIQ 

MHV DLSGYTVQPV GLVYRRVRNL PDCKIEEWLT AKSVPSPLNW ERKTFQNCNF DLSSLLRFVQ 

Rat CoV DLSGYTVQPV GLVYRRVRNL PDCKIEEWLA ANTVPSPLNW ERKTFQNCNF NLSSLLRFVQ 

PHEV ELNGYTVQPV ATVYRRIPDL PNCDIEAWLN SKTVSSPLNW ERKIFSNCNF NMGRLMSFIQ 

AIBV CYKHGGCPLT GMLQQNLIRV SAMKNGQLFY NLTVSVAKYP TFRSFQCVN- — NLTSVYLN 

SARS QTSNFRVVPS GDWRFPNIT NLCPFGEVFN ATKFPSVYAW ERKKISNCVA DYSVLYNSTF 



I 



EMCR S 

229E S 

PEDV 

TGEV 

CaCov 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



425 
TTIPFGPSSQ 
AHIPFGTVLG 
FAIPLGATEV 
GEIPFGVTDG 
GEIPFGVTDG 
GEIPFGITDG 
GEIPFGVTNG 
ADSFTCNNID 
ADSFTCNNID 
AESLSCSNID 
AESLSCSNID 
ADSFGCNNID 
GDLVYTSNET 
FSTFKCYGVS 



I I 

435 
PYYCFINSTI 
NFYCFVNTTI 
PYYCFLKVDT 

PRYCYV H 

PRYCYV L 

PRYCYV L 

PRYCYV L 

AAKIYG — MC 
AAKIYG— MC 
ASKVY6 — MC 
ASKVYG--MC 
ASRLYG — MC 
IDVTSAG— V 
ATKLND— LC 



I 



445 
NTTHVSTFVG 
GNETTSAFVG 
YNSTVYKFLA 
YNGTALKYLG 
YNGTALKYLG 
YNGTALKYLG 
YNGTALKYLG 
FSSITIDKFA 
FSSITIDKFA 
FGSISIDKFA 
FGSISIDKFA 
FGSITIDKFA 
YFKAGGPITY 
FSNVYADSFV 



I i 

455 
ILPPTVREIV 
ALPKTVREFV 
VLPPTVREIV 
TLPPSVKEIA 
TLPPSVKEIA 
TLPPSVKEIA 
TLPPSVKEIA 
IPNGRKVDLQ 
IPNGRKVDLQ 
IPNRRRVDLQ 
IPNSRRVDLQ 
IPNSRKVDLQ 
KVMREVKALA 
VKGDDVRQIA 



I I 

465 
VARTGQFYIN 
ISRTGHFYIN 
ITKYGDVYVN 
ISKWGHFYIN 
ISKWGHFYIN 
ISKWGHFYIN 
ISKWGHFYIN 
LGNLGYLQSF 
LGNLGYLQSF 
LGNSGFLQSF 
LGKSGLLQSF 
VGKSGYLQSF 
YFVNGTAQDV 
PGQTGVIADY 



I I 

475 
GFKYFDLGFl 
GYRYFTLGNV 
GFGYLHLGLL 
GYNFFSTFPI 
GYNFFSTFPI 
GYNFFSTFPI 
GYNFFSTFPI 
NYRIDTTATS 
NYRIDTTATS 
NYKIDTRATS 
NYKIDTRATS 
NYKIDTAVSS 
ILCDGSPRGL 
NYKLPODFMG 



EMCR S 
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....I I I I 

485 495 

EAVNFNVT TASATDFW 

EAVNFNVT TAETTDFC 

DAVTINFTGH GTDDDVSGFW 
DCISFNLT — — TGDSDVFW 
DCIAFNLT — — TGASGAFW 
GCISFNLT— — TGVSGAFW 

DCISFNLT TGDSDVFW 

CQLYYNLP AANVSVS 

CQLYYNLP AANVSVS 

CQLYYSLA KNNVTVN 

CQLYYSLA QDNVTVI 

CQLYYSLP AANVSVT 

LACQYNTG NFSDGFY 

CVLAWNTR NID 



I I 

505 
TVAFATFVDV 
TVALASYADV 
TIASTNFVDA 
TIAYTSYTEA 
TIAYTSYTEA 
TIAYTSYTEA 
TIAYTSYTEA 
RFNPSTWNKR 
RFNPSTWNRR 
NHNPSSWNRR 
NHNPSSWNRR 
HYNPSSWNRR 
PFTNSSLVKQ 
ATSTGNYNYK 



I I I I I I 

515 525 535 

LVNVSATNIQ NLLYCDSPFE KLQCEHLQFG 

LVNVSQTSIA NIIYCNSVIN RLRCDQLSFD 

LIEVQGTSIQ RILYCDDPVS QLKCSQVAFD 

LVQVENTAIT KVTYCNSHVN NIKCSQITAN 

LVQVENTAIK KVTYCNSHIN NIKCSQLTAN 

LVQVENTAIK NVTYCNSHIN NIKCSQLTAN 

LVQVENTAIT NVTYCNSYVN NIKCSQLTAN 

FGFIEDSVFK PRPAGVLTNH DVVYAQHCFK 

FGFTEQFVFK PQPVGVFTHH DVVYAQHCFK 

YGFND VATFGTGKH DVAYAEACFT 

YGFND VATFHSGEH DVAYAEACFT 

YGFNN QSFGSRGLH DAVYSQQCFN 

KFIVYR ENSVNT TCTLHNFIFH 

YRYLR HG 
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EMCR S 

229E S 
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CaCoV 
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SARS 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



EMCR S 
229E S 
PEDV 
TGEV 

CaCov 
FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat Cov 

PHEV 
AIBV 
SARS 



....I..., I 

545 
LQDGFY— SA 
VPDGFY— ST 
LDDGFYPISS 
LNNGFYPVSS 
LQNGFYPVAS 
LNNGFYPVAS 
LNNGFYPVSS 
APKNFCPCKL 
APKNFCPCKL 
VGASYCPCAN 
VGASYCPCAK 
TPNTYCPCRT 
NETGANPNPS 
KLRPFER 



.1 



555 
— NFLDDNVL 
— SPIQSVEL 
— RNLLSHEQ 
— SEVG — LV 
— SEVG — LV 
— SEVG — FV 
— SEVG — SV 
NGS-CVGSGP 
DGSLCVGNGP 
P-SIVSPCTT 
P-STVYSCVT 
— SQCIG 



I. ...I 

565 

P — -ET 

P VS 

P IS 

N KS 

N KS 

N KS 

N KS 

G KNNG 

GIDAGYKNSG 

G K-PN 

G K-PK 

G AG 

G 

D 



i I 

605 
CKPRQVNISL 
CYPAGVNITL 
ANLVASDTTI 
lASTLSNITL 
lASPLSNITL 
lASTLSNITL 
lASTLSNITL 
TPDPITFKAT 
TPDPITSKST 
NPSPLTTYDL 
NPSPLTTYDP 
QPDPSTYKGV 
LSSFVYKESN 



I 



615 

N GNTSV 

ANFNETKGPL 

N GFSSF 

PMQDHNTDVY 
PMQDNNIDVY 
PMQDNNTDVY 
PMQDNNNDVY 
GTYKCPQTKS 
GPYKCPQTKY 
— R-CLQARS 
— R-CLQARS 
NAWTCPQSKV 
FMYGSYHPSC 
P— AL 



I I 

625 
CVRTSHFSIR 
CVDTSHFTTK 
CVDTRQFTIT 
CIRSDQFSVY 
CIRSNQFSVY 
CIRSNQFSVY 
CVRSDQFSVY 
LVGIGEHCSG 
LVGIGEHCSG 
MLGVGDHCEG 
MLGVGDHCEG 
SIQPGQHCPG 
KFRLETINNG 
NCYWPLNDY- 



1 .... I 

575 
YVALPXYYQH 
IVSLPVYHKH 
FVTLPSFNDH 
WLLPSFYTH 
WLLPSFYSH 
VVLLPSFFTY 
VVLLPSFLTH 
IGTCPAGTNY 
IGTCPAGTNY 
FANCPTGTSN 
SANCPTGTSN 
TGTCPVGTTV 
VQNIQTYQTK 
ISNVPFSPDG 

I I 



585 
TDINFTATA- 
TFIVLYVDFK 
SFVNITVSA- 
TIVNITIGLG 
TSVNITIDLG 
TAVNITIDLG 
TIVNITIGLG 

LTCDN 

LTCHNAA 

RECTVMPLAN 
RECNVQASG- 
RKCFAAVTK- 
TAQSGYYNFN 
KPCTP 



I I 

595 
— SFGGSCYV 
PQSGGGKCFN 
— AFGG-LSS 
-MKRSGYGQP 
-MKRS-VTVT 
-MKLSGYGQP 
-MKRSGYGQP 

LC 

QCDCLC 

-NQFKCDCTC 
-FKSKCDCTC 
— ATKCTCWC 
FSF 



635 
YIYNRVKSGS 
YVAVYANVG- 
LFYNVTNSYG 
VHSTCKSALW 
VHSTCKSSLW 
VHSTCKSSLW 
VHSTCKSVLW 
LAVKSDYCGG 
LAIKSDYCGG 
LGVLEDKCGG 
LGILEDKCGG 
LGLVEDDCSG 
LWFNSLSVS- 
G 



I 1 1 1 

645 655 

p G DSSWHIYLKS 

RWSASINT 

YVSKSQD 

DNIFKRNCTD VLDATAVIKT 
DNNFNSACTD VLDATAVIKT 
DNIFNQDCTD VLEATAVIKT 
DNVFKRNCTD VLDATAVIKT 

N SCTCRPQAFL 

N PCTCQPQAFL 

S N TCNCSAHAFV 

S N ICNCSADAFV 

N PCTCKPQAFI 

lAYGPLQ 

— FYTTTGI 



665 
GTCPFSFSKL 
GNCPFSFGKV 
SNCPFTLQSV 
GTCPFSFDKL 
GTCPFSFDKL 
GTCPFSFDKL 
GTCPFSFDKL 
GWSADSCLQG 
GWSVDSCLQG 
GWAKDSCLAN 
GWAMDSCLSN 
GWSSETCLQN 
GGCKQSVFKG 
GYQPYRVWL 



I I 

675 
NNFQKFKTIC 
NNFVKFGSVC 
NDYLSFSKFC 
NNYLTFNKFC 
NNYLTFNKFC 
NNYLTFNKFC 
NNYLTFNKFC 
DKCNIFANFI 
DRCNIFANFI 
GRCHIFSNLM 
ARCHIFSNLM 
GRCNIFANFI 
RATCCYAYSY 
S FELLN 



I I 

685 
FSTVEVPGSC 
FSLKDIPGGC 
VSTSLLAGAC 
LSLSPVGANC 
LSLNPVGANC 
LSLSPVGANC 
LSLSPVGANC 
LHDVNSGLTC 
FHDVNSGTTC 
LNGINSGTTC 
LNGINSGTTC 
LNDVNSGTTC 
GGPSLCKGVY 
APATVCGPKL 



I I 

695 
NFPLEATW— 
AMPIVANW-- 
TIDLFGYP— 

KFDVAAR 

KLDVAAR 

KFDVAAR 

KFDVAAR 

STDLQKANTD 
STDLQKSNTD 
SMDLQLPNTE 
STDFQLPNTE 
STDLQQGNTI 

SGELDHN 

STDLIKN 



I 



705 
HYTSYTIVGA 
AYSKYYTIGS 
AFGSGVKLTS 
TRTNEQVVRS 
TRTNEQVFGS 
TRTNEQVVRS 
TRTNDQVVRS 
IILGVCVNYD 
IILGVCVNYD 
WTGVCVKYD 
WTGVCVKYD 
ITTDVCVNYD 

FECGLLV 

QCVNFN 



715 
LYVTWSEGNS 
LYVSWSDGDG 
LYFQFTKGEL 
LYVIYEEGDN 
LYVIYEEGDN 
LYVIYEEGDN 
LYVIYEEGDS 
LYGILGQGIF 
LYGITGQGIF 
LYGITGQGIF 
LYGSTGQGVF 
LYGITGQGIL 
YVTKSGGSRI 
FNGLTGTG-V 



I I I I I I 1 I I I 1 I 

725 735 745 755 765 775 

EMCR S ITGVPYPVSG IREFSNLVLN NCTKYNIYDY VGTGIIRSSN QSLAGGITYV S 

229E S ITGVPQPVEG VSSFMNVTLD KCTKYNIYDV SGVGVIRVSN DTFLNGITYT S 

PEDV ITGTPKPLEG ITDVSFMTLD VCTKYTIYGF KGEGIITLTN SSILAGVYYT S 

TGEV IVGVPSDNSG VHDLSVLHLD SCTDYNIYGR TGVGIIRQTN RTLLSGLYYT S 

CaCoV IVGVPSDNSG LHDLSVLHLD SCTDYNIYGR TGVGIIRKTN STLLSGLYYT S 

FeCoV IVGVPSDNSG LHDLSVLHLD SCTDYNIYGR TGVGIIRRTN STLLSGLYYT S 

Por Resp C IVGVPSDNSG LHDLSVLHLD SCTDYNIYGR TGVGIIRQTN RTILSGLYYT S 

OC43 VEVNATYYNS WQNLLYDSNG NLYGFRDYIT NRTFMIRSCY SGRVSAAFHA N 

BOCoV VEVNATYYNS WQNLLYDSNG NLYGFRDYLT NRTFMIRSCY SGRVSAAFHA N 

MHV KEVKADYYHS WQNLLYDVNG NLIGFRDFVA NKSYTIRSCY SGRVSAAYHQ D 

Rat CoV KEVKADYYNS WQNLLYDVNG NLNGFRDIVT NKTYLLRSCY SGRVSAAYHQ D 

PHEV lEVNATYYNS WQNLLYDSSG NLYGFRDYLS NRTFLIRSCY SGRVSAVFHA N 

AIBV QTATEPPVIT QNNYNNITLN TCVDYNIYGR TGQGFITNVT DSAVSYNYLA DAGLAILDTS 

SARS LTPSSKRFQP FQQFGRDVSD FTDSVRDPKT SEILDISPCS FGGVSVITPG TN A 

I f I I I I I I I I I I 

785 795 805 815 825 835 

EMCR S NSGNLLGFKN VSTGNIFIVT PCNQPDQVAV YQQ-SIIGAM TAVNBSRYGL QNLLQLPNFY 

229E S TSGNLLGFKD VTKGTIYSIT PCNPPDQLW YQQ-AWGAM LSENFTSYGF SNWELPKFF 

PEDV DSGQLLAFKN VTSGAVYSVT PCSFSEQAAY VND-DIVGVI SSLSNS — TF NNTRELPGFF 

TGEV LSGDLLGFKN VSDGVIYSVT PCDVSAQAAV IDG-TIVGAI TSINSELLGL THWTTTPNFY 

CaCoV LSGDLLGFKN VSDGWYSVT PCDVSAQAAV IDG-AIVGAM TSINSELLGL THWTTTPNFY 

FeCoV LSGDLLGFKN VSDGVIYSVT PCDVSAQAAV IDG-AIVGAM TSINSELLGL THWTTTPNFY 

Por Resp C LSGDLLGFTN VSDGVIYSVT PCDVSAQAAI IDG-TIVGAI TSINSELLGL THWTTTPNFY 

OC43 SSEPALLFRN IKCNYVFNNS LTRQLQPINY FDS-YLGCW NAYNSTAISV QTCDLTVGSG 

BoCoV SSEPALLFRN IKCNYVFNNT LSRQLQPINY FDS-YLGCW NADNSTSSVV QTCDLTVGSG 

MHV APEPALLYRN LKCDYVFNNN ISREETPLNY FDS-YLGCW NADNSTEEAV DACDLRMGSG 

Rat COV APEPALLYRN LKCDYVFNNN ISREETPLNY FDS-YLGCVI NADNSTEQSV DACDLRMGSG 

PHEV SSEPALMFRN LKCSHVFNNT ILRQIQLVNY FDS-YLGCW NAYNNTASAV STCDLTVGSG 

AIBV GSIDIFVVQG EYGLNYYKVN PCEDVNQQFV VSGGKLVGIL TSRNETGSQL LENQFYIKIT 

SARS SSEVAVLYQD VNCTDVSTAI HADQLTPAWR lYS-TGNNVF QTQAGCLIGA EHVDTSYECD 
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EMCR S 
229E S 
PEDV 

TGEV 
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FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
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EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat COV 
PHEV 
AIBV 
SARS 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 

cx:43 

BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



I I 

845 

YVS NG 

YAS NG 

YHS ND 

YYSI YNY 

YYSI YNY 

YYSI YNY 

YYSI YNY 

YCVD— YSK 

YCVD YST 

LCVN YST 

LCVN YSI 

YCVD YVT 

NGTRRFRRSI 
IPIG AGI 



I 



I 



I 



855 865 

GNN CTTAV 

TYN CTDAV 

GSN CTEPV 

TNDRTRGTAI DSNDVDCEPV 
TNVMNRGTAI D-NDIDCEPI 
TSERTRGTAI DSNDVDCEPV 
TNDKTRGTPI GSNDVDCEPV 

NRR SRGAI 

KRR SRRAI 

SHR ARSSV 

AHR ARRSV 

ALR SRRSF 

TEN VANCPY 

CAS YHTVSL 



I 



875 
MIYSNFGICA 
LTYSSFGVCA 
LVYSNIGVCK 
ITYSNIGVCK 
ITYSNIGVCK 
ITYSNIGVCK 
ITYSNIGVCK 
TTGYRFTNFE 
TTGYRFTNFE 
STGYKLTTFE 
STGYKLTTFE 
TTGYRFTNFE 
VSYGKFCIKP 
LRSTSQKSIV 



....I.... I 

885 
DGSLIPVRPR 
DGSIIAVQPR 
SGSIGYV-PS 
NGAFVFIN-V 
NGAI*VFIN-V 
NGALVFIN-V 
NGALVFIN-V 
PFTVNSVN — 
PFTVNSVN — 
PFTVRIVN— 
PFTVSIVN — 
PFAANLVN — 
DGSIATIVPK 
AYTMSLG 



J I 

895 
NSSDNGISAI 
NVSYDSVSAI 
QY6QVKIAPT 
THSDGDVQPI 
THSDGDVQPI 
THSDGDVQPI 
THSDGDVQPI 

DSLEPVG 

DSLEPVG 

DSVESVD 

DSVESVG 

DSIEPVG 

QLEQFVAPLF 
ADSSIAY 



I 1 

905 
-ITANLSIPS 
-VTANLSIPS 
-VTGNISIPT 
-STGNVTIPT 
-STGNVTIPT 
-STGNVTIPT 
-STGNVTIPT 
-GLYEIQIPS 
-GLYEIQIPS 
-GLYELQIPT 
-GLYEMQIPT 
-GLYEIQIPS 
NVTENVLIPN 
-SNNTIAIPT 

I I 



965 
LRLSAHLETN 
LRNSARLESA 
LQLSTIRLESV 
LAMGARLENM 
LAMGARLENM 
LAMGARLENM 
LAMGARLENM 
LTEVNELLDT 
LTEVNELLDT 
LGEVNNLIDT 
LGEVNNLIDT 
LTEVNELLDT 
VNSVGQKEDM 
LSGIAAEQDR 



1 I 

915 
NWTTSVQVEY 
NWTTSVQVEY 
NFSMSIRTEY 
NFTISVQVEY 
NFTISVQVEY 
NFTISVQVEY 
NFTISVQVEY 
EFTIGNMEEF 
EFTIGNMEEF 
NFTIASHQEF 
NFTIASHQEF 
EFTIGNLEEF 
SFNLTVTDEY 
NFSISITTEV 

I 



I I 

925 
LQITSTPIW 
LQITSTPIVV 
LQLYNTPVSV 
IQVYTTPVSI 
IQVYTTPVSI 
MQVYTTPVSI 
IQVYTTPVSI 
IQTSSPKVTI 
IQTSSPKVTI 
VQTRSPKVTI 
IQTRSPKVTI 
IQTRSPKVTI 
IQTRMDKVQI 
MPVSMAKTSV 



I I 

1025 

RNIHSS 

LPTSGS 

V— YDPASGR 
LKYILPSHNS 
LKDILPSHNS 
LKDILPSHNS 
LKYILPSDNS 
-SECSKASS- 
-SACNKVSS- 
-SDCGEVTMA 
-SDCSEGTKA 
-SECNRAST- 

PSSRR 

LKPTK- 

I I 



975 
DVSSMLTFDS 
DVSEMLTFDK 
EVNSMLTISE 
EVDSMLFVSE 
EIDSMLFVSE 
EVDSMLFVSE 
EVDSMLFVSE 
TQLQVANSLM 
TQLQVANSLM 
MQLQVASALI 
MQLQVASALI 
TQLQVANSLM 
ELLNFYSSTK 
NTREVFAQVK 

1 I 



I I 

985 
NA-FSLANVT 
KA-FTLANVS 
EA-LQLATIS 
NA-LKLASVE 
NA-LKLASVE 
NA-LKLASVE 
NA-LKLASVE 
NG-VTLSTKL 
NG-VTLSTKL 
QG-VTLSSRL 
QG-VTLSSRL 
NG-VTLSTKI 
PAGFNTPVLS 
QM-YKTPTLK 



1 I 

935 
DCATYVCNGN 
DCSTYVCNGN 
DCATYVCNGN 
DCSRYVCNGN 
DCARYVCNGN 
DCARYVCNGN 
DCSRYVCNGN 
DCAAFVCGDY 
DCSAFVCGDY 
DCAAFVCGGH 
DCAAFVCGDY 
DCATFVCGDY 
NCLQYVCGSS 
DCNMYICGDS 

I I 



995 

SFG D 

SFG D 

SFNG DG 

AFN SS 

AFN ST 

AFN ST 

AFN SS 

KDGVNFNVDD 
KDGVNFNVDD 
SDGIGGQIDD 
ADGISGQIDD 
KDGINFNVDD 

NVSTG E 

YFG G 



I 1 

945 
PRCKNLLKQY 
VRCVELLKQY 
SRCKQLLTQY 
PRCNKLLTQY 
PRCNKLLTQY 
PRCNKLLTQY 
PRCNKLLTQY 
AACKSQLVEY 
AACKSQLVEY 
TACRQQLVEY 
TACRQQLVDY 
AACRQQLAEY 
LDCRKLFQQY 
TECANLLLQY 

I I 



955 
TSACKTIEDA 
TSACKTIEDA 
TAACKTIESA 
VSACQTIEQA 
VSACQTIEQA 
VSACQTIEQA 
VSACQTIEQA 
GSFCDNINAI 
GSFCDNINAI 
GSFCDNINAI 
GSFCDNINAI 
GSFCENINAI 
GPVCDNILSV 
GSFCTQLNRA 



1035 
RIAGRSALED 
RVAGRSAIED 
VVQKRSVIED 
KRKYR5AIED 
KRKYRSAIED 
KRKYGSAIED 
KRKYRSAIED 

RSAIED 

RSAIED 

AQTGRSAIED 
AQ-GRSAIED 

RSAIED 

K RSLIED 

RSFIED 



1085 
IMVLPGVADA 
IMVLPGVADA 
VMVLPGWDA 
IMVLPGVANA 
IMVLPGVAND 
IMVLPGVANA 
IMVLPGVANA 
IKVLPPLLSE 
IKVLPPLLSV 
IKVLPPVLSE 
IKVLPPVLSE 
IKVLPPLLSE 
LLVLPPIITA 
LTVLPPLLTD 



I I 

1095 
ERMAMYTGSL 
ERMAMYTGSL 
EKLHMYSASL 
DKMTMYTASL 
DKMTMYTASL 
DKMTMYTASL 
DKMTMYTASL 
NQISGYTLAA 
NQISGYTLAA 
NQISGYTAGA 
SQISGYTAGA 
NQISGYTLAA 
EMQALYTSSL 
DMIAAYTAAL 



1045 
LLFSKWTSG 
ILFSKLVTSG 
LLFNKWTNG 
LLFDKWTSG 
LLFDKVVTSG 
LLFDKWTSG 
LLFSKWTSG 
LLFDKVKLSD 
LLFSKVKLSD 
VLFDKVKLSD 
VLFDKVKLSD 
LLFDKVKLSD 
LLPTSVESVG 
LLFNKVTLAD 

I 



1105 
IGGMVLGGLT 
IGGIALGGLT 
IGGMALGGIT 
AGGITLGALG 
TGGITLGALS 
AGGITLGALG 
AGGITLGALG 
TSASLFPLWT 
TSASLFPPLS 
TVSAMFP-WS 
TASAMFPPWS 
TAASLFPPWT 
VASMAFGGIT 
VSGTATAGWT 



1055 
LGTVDVDYKS 
LGTVDADYKK 
LGTVDEDYKR 
LGTVDEDYKR 
LGTVDEDYKR 
LGTVDEDYKR 
LGTVDEDYKR 
VG-FVEAYNN 
VG-FVEAYNN 
VG-FVEAYNN 
VG-FVESYNN 
VG-FVQAYNN 
LP-TNDAYKN 
AG-FMKQYGE 

I I 



1005 
YNLSSVLPQ- 
YNLSSVIPS- 
YNFTNVLGAS 
ETLDPIYKEW 
ENLDPIYKEW 
ENLDPIYKEW 
ETLDPIYKEW 
INFSPVLGCL 
INFSPVLGCL 
INFSPLLGCL 
INFSPLLGCL 
INFSPVLGCL 
FNISLLLTN- 
FNFSQILPDP 

I I 



1 



1015 



PNIGGSWLEG 
PNIGGSWLGG 
PSIGGSWLGG 
PNIGGFHLEG 

G 

G 

G 

G 

G 



1065 
CTKGLS— lA 
CTKGLS— lA 
CSNGRS— VA 
CTGGYD — lA 
SAGGYD— lA 
CTGGYD — lA 
CTGGYD — lA 
CTGGAE — IR 
CTGGAE — IR 
CTGGQE — VR 
CTGGQE — VR 
CTGGAE — IR 
CTAGPLGFFK 
CLGDIN — AR 



I I 

1075 
DLACAQYYNG 
DLACAQYYNG 
DLVCAQYYSG 
DLVCAQYYNG 
DLVCARYYNG 
DLVCAQYYNG 
DLVCAQYYNG 
DLICVQSYKG 
DLICVQSYNG 
DLLCVQSFNG 
DLLCVQSFNG 
DLICVQSYNG 
DLACAREYNG 
DLICAQKFNG 



1115 

S AAAIP 

-AVSIP 
-AAALP 
-AVAIP 
-AVAIP 
-AVAIP 
-AVAIP 
-AAGVP 
-AVGVP 
-AAGVP 
-AAGVP 
-AAGVP 
-AGAIP 



FGAGAALQIP 



I I 

1125 
FSLALQARLN 
FSLAIQARLN 
FSYAVQARLN 
FAVAVQARLN 
FAVAVQARLN 
FAVAVQARLN 
FAVAVQARLN 
FYLNVQYRIN 
FYLNVQYRIN 
FSLSVQYRIN 
FALSVQYRIN 
FYLNVQYRIN 
FATQLQARIN 
FAMQMAYRFN 



I I 

1135 
YVALQTDVLQ 
YVALQTDVLQ 
YLALQTDVLQ 
YVALQTDVLN 
YVALQTDVLN 
YVALQTDVLN 
YVALQTDVLN 
GLGVTMDVLS 
GIGVTMDVLS 
GLGVTMNVLS 
GLGVTMNVLS 
GLGVTMDVLS 
HLGITQSLLL 
GIGVTQNVLY 
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I I I I I I I I I I I I 

1145 1155 1165 1175 1185 1195 

EMCR S ENQKILAASF NKAINNIVAS FSSVNDAITH TAEAIHTVTI ALNKIQDVVN QQGSALNHLT 

229E S ENQKILAASF NKAMTNIVDA FTGVNDAITQ TSQALQTVAT ALNKIQDVVN QQGNSLNHLT 

PEDV RNQQLIAESF NSAIGNITSA FESVKEAISQ TSKGLNTVAH ALTKVQEVVN SQGSALNQLT 

TGEV KNQQILASAF NQAIGNITQS FGKVNDAIHQ TSRGLATVAK ALAKVQDVVN IQGQALSHLT 

CaCoV KNQQILANAF NQAIGNITQA FGKVNDAIHQ TSKGLATVAK ALAKVQDVVN TQGQALSHLT 

FeCoV KNQQILANAF NQAIGNITQA FGKVNDAIHQ TSQGLATVAK ALAKVQDVVN TQGQALSHLT 

For Resp C KNQQILASAF NQAIGNITQS FGKVNDAIHQ TSRGLTTVAK ALAKVQDVVN TQGQALRHLT 

OC43 QNQKLIANAF NNALYAIQEG FDATN S ALVKIQAVVN ANAEALNNLL 

BOCOV QNQKLIANAF NNALDAIQEG FDATN S ALVKIQAVVN ANAEALNNLL 

MHV ENQKMIASAF NNAIGAIQEG FAATN S ALAKMQFWN ANAEALNNLL 

Rat CoV ENQKMIASSF NNAIGAIQEG FDATN S ALAKIQSWN ANAEALNNLL 

PHEV QNQKLIASAF NNALDAIQEG FDATN S ALVKIQAVVN ANAEALNNLL 

AIBV KNQEKIAASF NKAIGHMQEG FRSTS L ALQQIQDWS KQSAILTETM 

SARS ENQKQIANQF NKAISQIQES LTTTS T ALGKLQDWN QNAQALNTLV 

I I I I I I I I I I I.... I 

1205 1215 1225 1235 1245 1255 

EMCR S SQLRHNFQAI SNSIHAIYDR LDSIQADQQV DRLITGRLAA LNAFVSQVLN KYTEVRGSRR 

229E S SQLRQNFQAI SSSIQAIYDR LDTIQADQQV DRLITGRLAA LNVFVSHTLT KYTEVRASRQ 

PEDV VQLQHNFQAI SSSIDDIYSR LDILSADVQV DRLITGRLSA LNAFVAQTLT KYTEVQASRK 

TGEV VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

CaCoV VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

FeCoV VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

Por Resp C VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

OC43 QQLSNRFGAI SASLQEILSR LDALEAEAQI DRLINGRLTA LNAYVSQQLS DSTLVKFSAA 

BoCoV QQLSNRFGAI SSSLQEILSR LDALEAQAQI DRLINGRLTA LNVYVSQQLS DSTLVKFSAA 

MHV NQLSNRFGAI SASLQEILSR LDALEAQAQI DRLINGRLTA LNAYVSKQLS DMTLVKVSAA 

Rat CoV NQLSNRFGAI SASLQEILSR LDALEAQAQI DRLINGRLTA LNAYVSKQLS DMTLIKVSAA 

PHEV QQLSNRFGAI SASLQEILSR LDALEAKAQI DRLINGRLTA LNAYVSQQLS DSTLVKFSAA 

AIBV ASLNKNFGAI SSVIQEIYQQ FDAIQANAQV DRLITGRLSS LSVLASAKQA EYIRVSQQRE 

SARS KQLSSNFGAI SSVLNDILSR LDKVBAEVQI DRLITGRLQS LQTYVTQQLI RAAEIRASAN 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat Gov 
PHEV 
AIBV 
SARS 



I I 

1265 
LAQQKINECV 
LAQQKVNECV 
LAQQKVNECV 
LAKDKVNECV 
LAKDKVNECV 
LAKDKVNECV 
LAKDKVNECV 
QAMEKVNECV 
QAMEKVNECV 
QAIEKVNECV 
QAIEKVNECV 
QAIEKVNECV 
LATQKINECV 
LAATKMSECV 



I 



1 



I 



1275 
KSQSNRYGFC 
KSQSKRYGFC 
KSQSQRYGFC 
RSQSQRFGFC 
RSQSQRFGFC 
RSQSQRFGFC 
RSQSQRFGFC 
KSQSSRINFC 
KSQSSRINFC 
KSQSSRINFC 
KSQSPRINFC 
KSQSSRINFC 
KSQSIRYSFC 
LGQSKRVDFC 



1285 
G-NGTHIFSI 
G-NGTHIFSI 
GGDGEHIFSL 
G-NGTHLFSL 
G-NGTHLFSL 
G-NGTHLFSL 
G-NGTHLFSL 
G-NGNHIISL 
G-NGNHIISL 
G-NGNHILSL 
G-NGNHILSL 
G-NGNHIISL 
G-NGRHVLTI 
G-KGYHLMSF 



1295 
VNSAPDGLLF 
VNAAPEGLVF 
VQAAPQGLLF 
ANAAPNGMIF 
ANAAPNGMIF 
ANAAPNGMIF 
ANTU^PNGMIF 
VQNAPYGLYF 
VQNAPYGLYF 
VQNAPYGLYF 
VQNAPYGLYF 
VQNAPYGLYF 
PQNAPNGIVF 
PQAAPHGWF 



1305 
LHTVLLPTDY 
LHTVLLPTQY 
LHTVLVPGDF 
FHTVLLPTAY 
FHTVLLPTAY 
FHTVLLPTAY 
FHTVLLPTAY 
IHFSYVPTKY 
IHFSYVPTKY 
IHFSYVPTSF 
IHFSYVPTSF 
IHFSYVPTKY 
IHFSYTPDSF 
LHVTYVPSQE 



I I 

1315 
KNVKAWSGIC 
KDVEAWSGLC 
VNVLAIAGLC 
ETVTAWPGIC 
ETVTAWSGIC 
ETVTAWSGIC 
ETVTAWSGIC 
VTARVSPGLC 
VTAKVSPGLC 
TTANVSPGLC 
TTVNVSPGLC 
VTAKVSPGLC 
VNVTAIVGFC 
RNFTTAPAIC 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



I I 

1325 

VDG lYG 

VDG TNG 

VNG EIA 

ASDG-DRTFG 
ASDG-SRTFG 
ASDG-DRTFG 
ALDV-DRTFG 

lAG DRG 

lAG DRG 

ISG DRG 

ISG DRG 

I AG DIG 

VKPANASQYA 
HEG KA 



I 



1335 
YVLRQPNLVL 
YVLRQPNLAL 
LTLREPGLVL 
LVVKDVQLTL 
LWEDVQLTL 
LVVKDVQLTL 
LVVKDVQLTL 
lAPKSGYFVN 
lAPKSGYFVN 
LAPKAGYFVQ 
LAPKAGYFVQ 
ISPKSGYFIN 
IVPANGRGIF 
YFPREGVFVF 



1345 

YS DN 

YK EG 

FTHELQTYTA 

FRN LD 

FRN LD 

FRN LD 

FRN LD 

VN 

VN 

DD 

DH 

VN 

IQVN 

NG 



I I 

1355 
GVFRVTSRVM 
NYYRITSRIM 
TEYFVSSRRM 
DKFYLTPRTM 
EKFYLTPRTM 
DKFYLTPRTM 
DKFYLTPRTM 
NTWMYTGSGY 
NTWMFTGSGY 
GEWKFTGSNY 
GEWKFTGSNY 
NSWMFTGSSY 
GSYYITARDM 
TSWFITQRNF 



I I 

1365 
FQPRLPVLSD 
FEPRIPTMAD 
FEPRKPTVSD 
YQPRVATSSD 
YQPRVATSSD 
YQPRVATSSD 
YQPRVATSSD 
YYPEPITENN 
YYPEPITGNN 
YYPEPITDKN 
YYPESITDKN 
YYPEPITQNN 
YMPRAITAGD 
FSPQIITTDN 



1 I 

1375 
FVQIYNCNVT 
FVQIENCNVT 
FVQIESCVVT 
FVQIEGCDVL 
FVQIEGCDVL 
FVQIEGCDVL 
FVQIEGCDVL 
VVVMSTCAVN 
VWMSTCAVN 
SVVMSSCAAN 
SWMSSCAVN 
VVVMSTCAVN 
WTLTSCQAN 
TFVSGNCDW 



EMCR S 
229E S 
PEDV 
TGEV 

cacov 
FeCov 

For Resp C 
OC43 
BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



I I 

1385 
FVNISRVELH 
FVNISRSELQ 
YVNLTSDQLP 
FVNATVSDLP 
FVNGTVIELP 
FVNATVIDLP 
FVNTTVSDLP 
YTKAPYVMLN 
YTKAPDVMLN 
YTKAPEVFLN 
YTKAPEVFLN 
YTKAPDLMLN 
YVSVNKTVIT 
IGIINNTVYD 



.1 



1395 
TVIP-DYVDV 
TIVP-EYIDV 
DVIP-DYIDV 
SIIP-DYIDI 
SIIP-DYIDI 
SIIP-DYIDI 
SIIP-DYIDI 
TSIP-NLPDF 
ISTP-NLHDF 
TSIP-NLPDF 
TSIT-NLPDF 
TSTP-NLPDF 
TFVDNDDFDF 
PLQP-ELDSF 



I I 

1405 
NKTLQEFAQN 
NKTLQELSYK 
NKTLDEILAS 
NQTVQDILEN 
NQTVQDILEN 
NQTVQDILEN 
NQTVQDILEN 
KEELDQWFKN 
KEELDQWFKN 
KEELDKWFKN 
KEELDKWFKN 
KEELYQWFKN 
NDELSKWWND 
KEELDKYFKN 



I I 

1415 
L-PKYVKPNF 
L-PNYTVPDL 
L-PNRTGPSL 
FRPNWTVPEL 
FRPNWTVPEL 
YRPNWTVPEF 
FRPNWTVPEL 
QTSVAPDLSL 
QTSVAPDLSL 
QTSIAPDLSL 
QTSIVPDLSF 
QSSVAPDLSL 
T — KHELPDF 
HTSPDVDLGD 



I I 

1425 
DLTPFNLTYL 
WEQYNQTIL 
PLDVFNATYL 
TFDIFNATYL 
PLDIFHATYL 
TLDIFNATYL 
TLDVFNATYL 
DY— INVTFL 
DY— INVTFL 
DFEKLNVTLL 
DIGKLNVTFL 
DY — INVTFL 
DKFNYTVPIL 
ISG-INASVV 



I I 

1435 
NLSSELKQLE 
NLTSEISTLE 
NLTGEIADLE 
NLTGEIDDLE 
NLTGEINDLE 
NLTGEIDDLE 
NLTGEIDDLE 

DLQVEMN 

DLQDEMN 

DLTDEMN 

DLSYEMN 

DLQDEMN 

DIDSEID 

NIQKEID 
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I 1 1 I I I I I I I I I 

1445 1455 1465 1475 1485 1495 

EMCR S AKTASLFQTT VELQGLIDQI NSTYVDLKLL NRFENYIKWP WWVWLIISW FWLLSLLVF 

229E S NKSAELNYTV QKLQTLIDNI NSTLVDLKWL NRVETYIKWP WWVWLCISVV LIFWSMLLL 

PEDV QRSESLRNTT EELRSHNNI NNTLVDLEWL NRVETYIKWP WWVWLIIVIV LIFVVSLLVF 

TGEV FRSEKLHNTT VELAILIDNI NNTLVNLEWL NRIETYVKWP WYVWLLIGLV VIFCIPLLLF 

CaCoV FRSEKLHNTT VELAILIDNI NNTLVNLEWL NRIETYVKWP WYVWLLIGLV VIFCIPILLF 

FeCoV FRSEKLHNTT VELAILIDNI NNTLVNLEWL NRIETYVKWP WYVWLLIGLV VVFCIPLLLF 

Por Resp C FRSEKLHNTT VELAILIDNI NNTWNLEWL NRIETYVKWP WYVWLLIGLV VIFCIPLLLF 

OC43 -RLQEAIKVL NQSYINLKDI GTYEYYVKWP WYVWLLICLA GVAMLVLLFF 

BoCoV -RLQEAIKVL NQSYINLKDI GTYEYYVKWP WYVWLLIGFA GVAMLVLLFF 

MHV RIQDAIKKL NESYINLKDV GTYEMYVKWP WYVWLLIGLA GVAVCVLLFF 

Rat Gov RIQDAIKNL NBSYINLKEI GTYEMYVKWP WYVWLLIGLA GVAVCVLLFF 

PHEV RLQEAIKVL NQSYINLKDI GTYEYYVKWP WYVWLLIGLA GVAMLVLLFF 

AIBV RIQGVIQGL NDSLIDLEKL SILKTYIKWP WYVWLAIAFA TIIFILILGW 

SARS -RLNEVAKNL NESLIDLQEL GKYEQYIKWP WYVWLGFIAG LIAIVMVTIL 

I I I I I 1 I I I 

1505 1515 1525 1535 1545 

EMCR S CCLSTGCCGC CNCLTSSMRG CCDCGSTKLP YYEFEKVHVQ 

229E S CCCSTGCCGF FSCFASSIRG CCES — TKLP YYDVEKIHIQ 

PEDV CCISTGCCGC CGCCGACFSG CCRG-PRLQP YEAFEKVHVQ 

TGEV CCCSTGCCGC IGCLGSCCHS ICSR-RQFEN YEPIEKVHVH 

CaCoV CCCSTGCCGC IGCLGSCCHS ICSR-GQFES YEPIEKVHVH 

FeCoV CCFSTGCCGC IGCLGSCCHS ICSR-RQFEN YEPIEKVHVH 

Por Resp C CCCSTGCCGC IGCLGSCCHS IFSR-RQFEN YEPIEKVHVH 

OC4 3 ICCCTG-CG- -TSCFKKCGG CCDDYTGYQE LVIKT SH DD 

BoCoV ICCCTG-CG TSCFKICGG CCDDYTGHQE LVIKT SH DD 

MHV ICCCTG-CG SCCFKKCGN CCDECGGHQD SIVIHNISSH ED 

Rat COV ICCCTG-CG SCCFKKCGN CCDEYGGRQA GIVIHNISSH ED 

PHEV ICCCTG-CG TSCFKKCGG CCDDYTGHQE FVIKT SH DD 

AIBV VFFMTGCCGC CCGCFGIMPL MSKCGKKSSY YTTFDNDWT EQYRPKKSV 

SARS LCCMTSCCS CLKGACSCG SCCKFDEDDS EPVLKGVKLH YT 



f. Putative Orf 4a 

— I — I — I — 1 — I — I — I — I — I — I — I — I 

5 15 25 35 45 55 

EMCR 4a MPFGGLFQLT LESTINKSVA NLKLPPHDVT VLRDNLKPVT TLSTITAYLL VSLFVTYFAL 

229E 4a MALG-LFTLQ LVSAVNQSLS NAKVSAEVSR QVIQDVKDGT VTFNLLAYTL MSLFWYFAL 

\ I I I I I I I I I I I 

65 75 85 95 105 115 

EMCR 4a FKPLTARGRV ACFVLKLLTL SVYVPLLVLF GMYLDSFIIF FLRCCFDSYM LAIMPISNKN 

229E 4a FKARSHRGRA ALIVFKILIL FVYVPLLYWS QAYIYATLIA VILLG-RFFH TAWHCWLYKT 

I I I I I I I I I I I I 

125 135 145 155 165 175 
EMCR 4a FSFVLFNVTK LCFVSGKCWY LEQSFYENRF AAIYGGDHYV VLGGETITFV SFDDLYVAIR 
229E 4a WDFIVFNVTT LCYAR 

I I I I I I I I I. 

185 195 205 215 225 
EMCR 4a GSCEKNLQLM RKVDLYNGAV lYIFAEEPW GIVYSSQLYE DVPSIN 
229E 4a 
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Putative Orf 4ab 



EMCR 4a 
229E 4a 
229E 4b 



EMCR 4a 
229E 4a 
229E 4b 



EMCR 4 a 
229E 4a 
229E 4b 



EMCR 4a 
229E 4a 
229E 4b 



....I. ...I ....I.... I ..I ..I ,.1 

5 15 25 35 45 55 

MPFGGLFQLT LESTINKSVA NLKIiPPHDVT VLRDNLKPVT TLSTITAYLL VSLFVTYFAL 
MALG-LFTLQ LVSAVNQSLS NAKVSAEVSR QVIQDVKDGT VTFNLLAYTL MSLFVVYFAL 

....|..,.| ....I.,.. I I ....I.. ..I ....! 1 ....I.. ..I 

65 75 85 95 105 115 

FKPLTARGRV ACFVLKLLTL SVYVPLLVLF GMYLDSFIIF FLRCCFDSYM LAIMPISNKN 

FKARSHRGRA ALIVFKILIL FVYVPLLYWS QAYIYATLIA VILLG-RFFH TAWHCWLYKT 

I ....I. ...I ....I.... I ....I I I.. ..I 

125 135 145 155 165 175 

FSFVLFNVTK LCFVSGKCWY LEQSFYENRF AAIYGGDHYV VLGGETITFV SFDDLYVAIR 

WDFIVFNVTT LCYAR 

MQGKCW FLENKALKPF VCFYGGDQFL YIGDRIVSYF STNDLYVALR 

I I I 1 I I I I I . 

185 195 205 215 225 

GSCEKNLQLM RKVDLYNGAV lYIFAEEPW GIVYSSQLYE DVPSIN 

GRIDKDLSLS RKVELYNGEC VYLFCEHPAV GIVHTDFKLE IH . . . . 



h. 



Putative Orf E 



I 



EMCR E 

229E 

PEDV 

TGBV 

CaCoV 

FeCoV 

Por Resp C 

OC43 

BoCoV 

PHEV 

MHV 

Rat Gov 

AIBV 

SARS 



EMCR E 

229E 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 

OC43 

BoCoV 

PHEV 

MHV 

Rat COV 

AIBV 

SARS 



MFLRLI 

MFLKLV 

MLQLV 

MTFPRALTVI 
MTFPRALTVI 
MTFPRAFTII 
MTFPRALTVI 
— MFMADAYL 
— MFMADAYF 
— MFMADAYL 

MFNLFL 

MFNLFL 

— MNLLNKSL 
MYSFVS 



I I 

15 

DDNG-IVLNS 
DDHA-LVVNV 
NDNG-LWNV 
DDNG-MVINI 
DDNG-MVISI 
DDHG-MWSV 
DDNG-MVISI 
ADTV-WYVGQ 
ADTV-WYVGQ 
ADTV-WYVGQ 
TDTV-WYVGQ 
IDTV-WYVGQ 
EENG-SFLTA 
EETGTLIVNS 



I I 

25 

ILWLLVMIFF 
LLWCWLIVI 
ILWLFVLFFL 
IFWFLLIIIL 
IFWFLLIIIL 
FFWLLLIIIL 
IFWFLLIIIL 
IIFIVAICLL 
IIFIVAICLL 
IIFIVAICLL 
IIFIVAVCLM 
IIFIVAVCLM 
LYIIVGFLAL 
VLLFLAFWF 



I I 

65 

VYKIFL- 

IKNVYH- 

IGRLYR- 

AQHAYD- 

AIUIAYD- 

ARHAYD- 

VQHAYD- 

SIYVFNR 

SIYVFNR 

SIYVFNR 

SICVYNR 

SIYVYNR 

AKGTAFVYKY 
TVYVYS- 



.1-. 
75 



I 



GR- 
GR- 
GR- 
SK- 
SK- 



..I I 

85 

--AYQDYM 
— lYQSYM 
— VYKSYM 
— AYKNFM 
— AYKNFM 
— AYKTFM 
—AYKNFM 
-QFYEFYN 
-QFYEFYN 
-QFYEFYN 
-QLYKYYN 
-QLYKYYN 



I 1 

35 

F-VLAMTFIK 
L-LVCITIIK 
L-IISITFVQ 
I-LLSIALLN 
I-LFSIALLN 
I-LFSIALLN 
I-LLSIALLN 
VTIVVVAFLA 
VIIVVVAFLA 
VIIVWAFLA 
VTIIWAFLA 
VTIIWAFLA 
Y-LLGRALQA 
L-LVTLAILT 



TYGRKLNNPE LEAVIVNEFP 
RVKNLN 



I I 

95 

— QIAPV-PA 
— HIDPF-PK 
~RIDPL-PS 
— RIKAYNPD 
— QIRAYNPD 
— QTKAYNPD 
—RIKAYNPD 
~DVKPP-VL 
~DVKPP-VL 
— DVKPP-VL 
E-EVRPP-PL 
E-EVRPP-PL 
KNGWNNKNPA 
— SSEGV-PD 



I 1 

45 

LIQLCFTCHY 
LIKLCFTCHM 
LVNLCFTCHR 
IIKLCMVCCN 
IIKLCMVCCN 
VIKLCMVCCN 
IIKLCMVCCN 
TFKLCIQLCG 
TFKLCIQLCG 
TFKLCIQLCG 
SIKLCIQLCG 
SIKLCIQLCG 
FVQAADACCL 
ALRLCAYCCN 



I I 

55 

FFSRTLYQP- 
FCNRTVYGP- 
LCNSAVYTP- 
LGRTVIIVP- 
LGRTVIIVP- 
LGKTIIVLP- 
LGRTVIIVP- 
MCNTLVLSP- 
MCNTLVLSP- 
MCNTLVLSP- 
LCNTLLLSP- 
LCNTLLLSP- 
FWYTWWIPG 
IVNVSLVKP- 



I.. 

105 

EVLNV — 

RVIDF 

TVIDV 

GALLA — 

EALLV — 

EAFLV 

GALLV — 

DVDDV — 

DVDDV — 

DVDDV — 

EVDDIIIQTL — 
EVDDIIIQTL — 
NFQDAQRDKL YS 
LLV — 
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1 , Putative Orf M (Matrix protein) 

I I I I !....( 1 I I 1 I I 

5 15 25 35 45 55 

EMCR M SNSS 

229E M SNDN 

PEDV M SNGS 

TGEV MK ILLILACVIA CACGERYCAM KSDTDLSCRN 

CaCoV MKK ILFLIACAIA CVYGERYCAM TESS-TSCRN 

FeCoV MHMMPIRPLC KPRHIIPTKH FHFELNKMKY ILLILACIIA CVYGERYCAM QDSG-LQCIN 

PRCOV MK ILLILACAIA CTCGERYCM KDDTGLSCRN 

OC43 M SSKT 

PHEV M SSPT 

BoCoV M SSVT 

MHV M TSTTQ 

RatSAV M SSTTP 

AIBV M PNETN 

SARS M ADNG 

I I I I I I I I I I I I 

65 75 85 95 105 115 

EMCR V PLSEVYVHLR NWNFSWNLIL TVFIVVLQYG HYKYSRLLYG LKMSVLWCLW 

229E C -TGDIVTHLK NWNFGWNVIL TIFIVILQFG HYKYSRLFYG LKMLVLWLLW 

PEDV I PVDEVIEHLR NWNFTWNIIL TILLWLQYG HYKYSVFLYG VKMAILWILW 

TGEV STASOCESCF NGGDLIWHLA NWNFSWSIIL IVFITVLQYG RPQFSWFVYG IKMLIMWLLW 

CaCoV STAGNCASCF ETGDLIWHLA NWNFSWSVIL IIFITVLQYG RPQFSWFVCG IKMLIMWLLW 

FeCoV GTNSRCQTCF ERGDLIWHLA NWNFSWSVIL IVFITVLQYG RPQFSWLVYG IKMLIMWLLW 

PRCoV GTASDCESCF NRGDLIWLLA NWNFSWSIIL IIFITVLQYG RPQFSWFVYG IKMLIMWLLW 

OC43 — TPAPVYIW TADEAIKFLK EWNFSLGIIL LFITIILQFG YTSRSMFVYV IKMIILWLMW 

PHEV — TPVPVISW TADEAIKFLK EWNFSLGIIV LFITIILQFG YTSRSMFVYV IKMVILWLMW 

BoCoV — TPAPVYTW TADEAIKFLK EWNFSLGIIL LFITIILQFG YTSRSMFVYV IKMIILWLMW 

MHV — APQPVYQW TADEAIRFLK EWNFSLGIIL LFVTIILQFG YTSRSMFVYV VKMILLWLMW 

RatSAV — APQTVYQW TADVAVRFLK EWNFLLGIIL LFITIILQFG YTSRSMFIYV VKMIILWLMW 

AIBV CTL DFEQSVQLFK EYNLFITAFL LFLTIILQYG YATRSKVIYT LKMIVLWCFW 

SARS TI TVEELKQLLE QWNLVIGFLF LAWIMLLQFA YSNRNRFLYI IKLVFLWLLW 

I I I I I I I I I I I I 

125 135 145 155 165 175 

EMCR PLVLALSIFD CFVNFNVD-W VFFGFSILMS IITLCLWVMY FVNSFRLWRR VKTFWAFNPE 

229E PLVLALSIFD TWANWDSN-W AFVAFSFFMA VSTLVMWVMY FANSFRLFRR ARTFWAWNPE 

PEDV PLVLALSLFD AWASFQVN-W VFFAFSILMA CITLMLWIMY FVNSIRLWRR THSWWSFNPE 

TGEV PWLALTIFN AYSEYQVSRY VMFGFSIAGA IVTFVLWIMY FVRSIQLYRR TKSWWSFNPB 

CaCoV PIVLALTIFN AYLEYRVSRY VMFGFSVAGA TVTFILWIMY FVRSIQLYRR TKSWWSFNPE 

FeCoV PIVLALTIFN AYSEYQVSRY VMFGFSVAGA WTFALWMMY FVRSVQLYRR TKSWWSFNPB 

PRCoV PIVLALTIFN AYSEYQVSRY VMFGFSIAGA IVTFVLWIMY FVRSIQLYRR TKSWWSFNPB 

OC4 3 PLTIILTIFN — CVYALN-N VYLGLSIVFT IVAIIMWIVY FVNSIRLFIR TGSFWSFNPE 

PHEV PLTIILTIFN — CVYALN-N VYLGFSIVFT IVAIIMWWY FVNSIRLFIR TGSWWSFNPE 

BoCoV PLTIILTIFN — CVYALN-N VYLGFSIVFT IVAIIMWIVY FVNSIRLFIR TGSWWSFNPE 

MHV PLTIVLCIFN — CVYALN-N VYLGFSIVFT IVSIIMWIMY FVNSIRLFIR TGSWWSFNPE 

RatSAV PLTIVLCIFN — CVYALN-N VYLGFSIVFT IVSIVMWIMY FVNSIRLFIR TGSWWSFNPE 

AIBV PLNIAVGVIS — CTYPPN-T GGLVAAIILT VFACLSFVGY WIQSIRLFKR CRSWWSFNFB 

SARS PVTLACFVLA — AVYRIN-W VTGGIAIAMA CIVGLMWLSY FVASFRLFAR TRSMWSFNPB 

I I I I I I I I I I I I 

185 195 205 215 225 235 

EMCR TNAIISLQVY -GHNYYLPVM AAPTGVTLTL LSGVLLVDGH KIATRVQVGQ LPKYVIVATP 

22 9E VNAITVTTVL -GQTYYQPIQ QAPTGITVTL LSGVLYVDGH RLASGVQVHN LPEYMTVAVP 

PEDV TDALLTTSVM -GRQVCIPVL GAPTGVTLTL LSGTLLVEGY KVATGVQVSQ LPNFVTVAKA 

TGEV TKAILCVSAL -GRSYVLPLE GVPTGVTLTL LSGNLYAEGF KIAGGMNIDN LPKYVMVALP 

CaCoV TSAILCVSAL -GRSYVLPLE GVPTGVTLTL LSGNLCAEGF KIAGGMNIDN LPKYVMVALP 

FeCoV TNAILCVNAL -GRSYVLPLD GTPTGVTLTL LSGNLYAEGF KMAGGLTIEH LPKYVMIATP 

PRCOV TNAILCVSAL -GRSYVLPLE GVPTGVTLTL LSGNLYAEGF KIAGGMTIDN LPKYVMVALP 

OC43 TNNLMCIDMK -GTMYVRPII EDYHTLTVTI IRGHLYIQGI KLGTGYSLAD LPAYMTVAK- 

PHEV TNNLMCIDMK -GRMYVRPII EDYHTLTATI IRGHLYIQGI KLGTGYSLSD LPAYVTVAK- 

BoCoV TNNLMCIDMK -GRMYVRPII EDYHTLTVTI IRGHLYMQGI KLGTGYSLSD LPAYVTVAK- 

MHV TNNLMCIDMK -GTVYVRPII EDYHTLTATI IRGHLYMQGV KLGTGFSLSD LPAYVTVAK- 

RatSAV TNNLMCIDVK -GTVYVRPII EDYHTLTATN VRGHLYMQGV KLGTGFSLSD LPAYVTVAK- 

AIBV SNAVGSILLT NGQQCNFAIE SVPMVLSPII KNGVLYCEGQ WLAK-CEPDH LPKDIFVCTP 

SARS TNILLNVPLR -GTIVTRPLM ESELVIGAVI IRGHLRMAGH PLGR-CDIKD LPKBITVAT- 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 PCT/NL2004/000805 
^ 85/87 

...,|....| ....I.... I ....I. ...I I I 

245 255 265 275 285 

EMCR STTIVCDRVG RSVNETSQTG WAFYVRAKHG DFSGVASQEG VLSEREKLLH LI 

229B STTIIYSRVG RSVNSQNSTG WVFYVRVKHG DFSAVSSPMS NMTENERLLH FF 

PEDV TTTIVYGRVG RSVNASSGTG WAFYVRSKHG DYSAVSNPSA VLTDSEKVLH LV 

TGEV SRTIVYTLVG KKLKASSATG WAYYVKSKAG DYSTEAR-TD NLSEQEKLLH MV 

CaCoV VRTIVYTLVG KKLKASSATG WAYYVKSKAG DYSTDAR-TD NLSEHEKLLH MV 

FeCoV SRTIVYTLVG KQLKATTATG WAYYVKSKAG DYSTEAR-TD NLSEHEKLLH MV 

PRCOV SRTIVYTLVG KKLKASSATG WAYYVKSKAG DYSTEAR-TD NLSEQEKLLH MV 

OC43 VTHLCTYKRG FLDRISDTSG FAVYVKSKVG NYRLPSTQKG SGMDTALLRN NI 

PHEV VTHLCTYKRG FLDRIGDTSG FAVYVKSKVG NYRLPSTHKG SGMDTALLRN NI 

BoCoV VSHLLTYKRG FLDKIGDTSG FAVYVKSKVG NYRLPSTQKG SGMDTALLRN NI 

MHV VSHLCTYKRA FLDKVDGVSG FAVYVKSKVG NYRLPSN-KP . SGMDTALLR- -I 

RatSAV VSHLCTYKRA FLDKVDGVSG FAVYVKSKVG NYRLPSN-KP SGADTALLR I 

AIBV DRRNIYRMVQ KYTGDQSGNK KRFATFVYAK QSVDTGELES VATGGSSLYT — 

SARS SRTLSYYKLG ASQRVGTDSG FAAYNRYRIG NYKLNTDHAG SNDNIALLVQ — 



J . Putative Orf N (Nucleoprotein) 

— I — I — I. ...I — I — I — I — I — I — I — I — I 

5 15 25 35 45 55 

EMCR — MAS VN W ADDR AARKKF— 

22 9E MAT VK W ADASEPQ RGRQGR 

PEDV — MAS VS F QDRG — RKR 

TGEV — MANQGQR- VS W GDESTKT — RGRSNSRG RKNNN 

FeCoV — MATQGQR- VN W GDEPSKR — RGRSNSRG RKNND 

PRCoV MANQGQR- VS W GDESTKI RGRSNSRG RKINN 

CaCoV — MASQGQR VS W GDESTKR — RGRSNSRG RKNND 

RSDACoV MSFVPGQENA GSRSSSGNRA GNGILKKTTW ADQTERGQNN GNRGRRNQPK QTATTQ-PNT 

MHV MSFVPGQENA GSRSSSGNRA GNGILKKTTW ADQTERG NRGRRNHPK QTATTQ-PNA 

PHEV MSFTPGKQSS -SRASSGNRS GNGILK W ADQSDQSRNV QTRGRRVQSK QTATSQQPSG 

OC43 MSFTPGKQSS -SRASSGNRS GNGILK W ADQSDQFRNV QTRGRRAQPK QTATSQQPSG 

BoCoV MSFTPGKQSS -SRASFGNRS GNGILK W ADQSDQSRNV QTRGRRAQPK QTATSQLPSG 

SARS MSDNGPQS NQRSAPRITF GGPTDSTDNN QNGGRNGARP KQRRPQ 

AIBV MASG K A AGKTDAPAPV IKLGGPKPPK VGSS 

I I I I I I I I I I I I 

65 75 85 95 105 115 

EMCR PPPSFY MPLLVSSDKA PYRVIPRNLV PIGKGNK-DE QIGYWNVQER — WRMRRGQR 

229E IPYSLY SPLLVDSE-Q PWKVIPRNLV PINKKDK-NK LIGYWNVQKR — FRTRKGKR 

PEDV VPLSLY APLRVTNDKP LSKVLANNAV PTNKGNK-DQ QIGYWNEQIR — WRMRRGER 

TGEV IPLSFF NPITLQQGSK FWNLCPRDFV PKGIGNR-DQ QIGYWNRQTR — YRMVKGQR 

FeCoV IPLSFY NPITLEQGSK FWNLCPRDLV PKGIGNK-DQ QIGYWNRQIR — YRIVKGQR 

PRCoV IPLSFF NPITLQQGAK FWNSCPRDFV PKGIGNR-DQ QIGYWNRQTR — YRMVKGQR 

CaCoV IPLSFF NPITLEQGSK FWDLCPRDFV PKGIGNK-DQ QIGYWNRQTR — YRMVKGRR 

RSDACoV GSVVPHYSWF SGITQFQKGK EFQFAGGQGV PIANGIPPSE QKGYWYRHNR RSFKTPDGQQ 

MHV GSVVPHYSWF SGITQFQKGK EFQFAQGQGV PIASGIPASE QKGYWYRHNR RSFKTPDGQH 

PHEV GTWPYYSWF SGITQFQKGK EFEFAEGQGV PIAPGVPSTE AKGYWYRHNR RSFKTADGNQ 

OC43 GNWPYYSWF SGITQFQKGK EFEFVEGQGV PIAPGVPATE AKGYWYRHNR RSFKTADGNQ 

BoCoV GNWPYYSWF SGITQFQKGK EFEFAEGQGV PIAPGVPATE AKGYWYRHNR RSFKTADGNQ 

SARS GLPNNTASWF TALTQHGK-E ELRFPRGQGV PINTNSGPDD QIGYYRRATR R-VRGGDGKM 

AIBV GNASWF QAIKAKKLNT PPPKFEGSGV PDNENIKPSQ QHGYWRRQAR — FKPGKGGR 

I I I 1 I I I 1 \ I I I 

125 135 145 155 165 175 

EMCR VDLPPKVHFY YLGTGPHKDL KFRQRSDGVV WVAKEGAKTV NTSLGNRK— RNQKPLEPKF 

229E VDLSPKLHFY YLGTGPHKDA KFRERVEGW WVAVDGAKTE PTGYGVRR— KNSEPEIPHF 

PEDV lEQPSNWHFY YLGTGPHGDL RYRTRTEGVF WVAKEGAKTE PTNLGVRK — ASEKPIIPKF 

TGEV KELPERWFFY YLGTGPHADA KFKDKLDGVV WVAKDGAMNK PTTLGSRG — ANNESKALKF 

FeCoV KELAERWFFY FLGTGPHADA KFKDKIDGVF WVARDGAMNK PTTLGTRG — TNNESKPLRF 

PRCoV KELPERWFFY YLGTGPHADA KFKDKLDGVV WVAKDGAMNK PTTLGSRG — ANNESKALKF 

CaCoV KNLPEKWFFY YLGTGPHADA KFKQKLDGW WVARGDSMTK PTTLGTRG — TNNESKALKF 

RSDACoV KQLLPRWYFY YLGTGPHAGA SFGDSIEGVF WVANSQADTN TSADIVERDP SSHEAIPTRF 

MHV KQLLPRWYFY YLGTGPHAGA BYGDDIEGVV WVASQQADTK TTADVVERDP SSHEAIPTRF 

PHEV RQLLPRWYFY YLGTGPHAKD QYGTDIDGVF WVASNQADIN TPADIVDRDP SSDEAIPTRF 

OC43 RQLLPRWYFY YLGTGPHAKD QYGTDIDGVY WVASNQADVN TPADIVDRDP SSDEAIPTRF 

BoCoV RQLLPRWYFY YLGTGPHAKD QYGTDIDGVF WVASNQADVN TPADILDRDP SSDEAIPTRF 

SARS KELSPRWYFY YLGTGPEASL PYGANKEGIV WVATEGALNT PKDHIGTRNP NNNAATVLQL 

AIBV KPVPDAWYFY YTGTGPAADL NWGDTQDGIV WVAAKGADTK SRSNQGTRDP DKFDQYPLRF 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/049814 



PCT/NL2004/000805 



86/87 



EMCR 

229E 

PEDV 

TGEV 

FeCoV 

PRCoV 

CaCoV 

RSDACoV 

MHV 

PHEV 

OC43 

BoCoV 

SARS 

AIBV 



I I 

185 
SIALPPELSV 
NQKLPNGVTV 
SQQLPSVVEI 
DGKVPGEFQL 
DGKIPPQFQL 
DGKVPGEFQL 
DVKVPSEFHL 
APGTVLPQGF 
APGTVLPQGF 
PPGTVLPQGY 
PPGTVLPQGY 
PPGTVLPQGY 
PQGTTLPKGF 
SDGGPDGNFR 



I I 

195 
VEFEDRSNNS 

VEEPD 

VEPNTPP — A 

EVNQS 

EVNRS 

EVNQS 

EVNQL 

YVEGS 

YVEGS 

YIEGS 

YIEGS 

YIEGS 

YAEGS 

WDFIP 



215 



..I I 

225 235 

NNSR DS 

SQSR 



I I 

205 
SRASSRSSTR 
SRAPSRSQSR 

SRAMSRSRSR GNGNNRSRSP SNNRGNNQSR GNSQNRGNNQ 

-RDNSRSRSQ 

-RNNSRSGSQ 

-RDNSRSRSQ 

-RDNSRSRSQ 

GRSAPASRSG 

GRSAPASRSG 

GRSAPNSRST 

GRSAPNSRST 

GRSAPNSRST 

-RGGSQASSR 

— LN-RGRSG 



EMCR 

229E 

PEDV 

TGEV 

FeCoV 

PRCoV 

CaCoV 

RSDACoV 

MHV 

PHEV 

OC43 

BoCoV 

SARS 

AIBV 



I I I I 

245 255 

SRSTSRQ QSR- 

GRGESKP QSRN 

GRGASQNRGG NNNNNNKSRN 

SRSRSRNR SQSRG 

SRSVSRNR SQSRG 

SRSRSRNR SQSRG 

SRSQSRNR SQSRG 

SRSQSRGP — NNRA 

SRSQSRGP — NNRA 

SRAPNRAPS- AGSRS 

SRTSSRASS AGSRS 

SRASSRASS AGSRS 

SSSRSRGN SRNST 

-RSTAASS AAASRA 



I I 

265 
TRSDSNQS — 
PSSDRNHN — 



..I I 

275 

S-SDL 

SQDDI 

QSNNRNQSND RGGVTSRDDL 

RQQFNNKK DDSV 

RHHSNNQ NNNV 

RQQSNNKK DDSV 

RQLSNNKK — DDNV 

RSSSNQRQ PASTV 

RSSSNQRQ — PASAV 

RANSGNRT STPGV 

RANSGNRT PTSGV 

RANSGNRT PTSGV 

PGSSRGNS PARMA 

PSREGSRG RRSDS 



I I 

285 
VAAVTLALKN 
MKAVAAALKS 
VAAVRDALKS 
EQAVLAALKK 
EDTIVAVLEK 
EQAVLAALKK 
EQAVLAALKK 
KPDMAEEIAA 
KPDMAEEIAA 
TPDMADQIAS 
TPOMADQIAS 
TPDMADQIAS 
SGGGETALAL 
GDDLIARAAK 



I I 

295 
LGFDN — QSK 
LGFDKP-QEK 
LGIGEN-PDR 
LGVDTE-KQQ 
LGV-TD-KQ- 
LGVYTE-KQQ 
LGVDTE-KQQ 

LVLAN LG 

LVLAK LG 

LVLAK LG 

LVLAK LG 

LVLAK LG 

LLLORLNQLE 
IIQDQ 



EMCR 

229E 

PEDV 

TGEV 

FeCoV 

PRCOV 

CaCov 

RSDACoV 

MHV 

PHEV 

OC43 

BoCoV 

SARS 

AIBV 



EMCR 

229E 

PEDV 

TGEV 

FeCoV 

PRCoV 

CaCoV 

RSDACOV 

MHV 

PHEV 

OC43 

BoCoV 

SARS 

AIBV 



EMCR 

229E 

PEDV 

TGEV 

FeCoV 

PRCOV 

CaCoV 

RSDACoV 

MHV 

PHEV 

OC43 

BoCoV 

SARS 

AIBV 



1 I 

305 
SPSSSGTSTP 
DKKSAKTGTP 
HKQQQKPKQE 
QRSRSKSKER 
-RSRSKPRER 
QRSRSKSKER 
-RSRSKSKER 
-KDAGQPKQV 
-KDAGQPKQV 
-KDATKPQQV 
-KDATKPQQV 
-KDATKPQQV 
SKVSGKGQQQ 
QKKGSRI 





I I I 1 

315 325 

K K PNKPLSQ 

KPSRNQSPAS SQTSAKSLAR 

K-SDN SG KNTPKKNKSR 

S NSKTR 

S DSKPR 

S NSKTR 

S SSKTR 

T KQSAK 

T KQSAK 

T KQTAK 

T KHTAK 

T KQTAK 

Q GQTVTK 

T KAKAD 



I I 

335 

PRADKPS 

SQSSETKEQK 

ATSKERD 

DTTPBCNE 

DTTPKNA 

DTTPKNE 

DTTPKNE 

EVRQKIL 

EVRQKIL 

EVRQKIL 

EVRQKIL 

EIRQKIL 

KSAAEAS 

EMAHRRY 



I I 

345 
-QLKKPRWKR 
HEMQKPRWKR 
-LKDIPEWRR 

NKHTWKR 

NKHTWKK 

NKHTWKR 

NKHTWKR 

NKPRQKR 

TKPRQKR 

NKPRQKR 

NKPRQKR 

NKPRQKR 

KKPRQKR 

CKRT 



I I 

355 
VPTR — EENV 
QPNDDVTSNV 
IPKG— ENSV 

TAGK GDV 

TAGK GDV 

TAGK GDV 

TAGK GDV 

TPNK — QCPV 
TPNK — QCPV 
SPNK — QCTV 
SPNK — QCTV 
SPNK — QCTV 
TATK— QYNV 
IPPN YRV 



365 
IQCFGPRDFN 
TQCFGPRDLD 
AACFGPRGGF 
TRFYGARSSS 
TTFYGARSSS 
TRFYGARSSS 
TKFYGARSSS 
QQCFGKRGPN 
QQCFGKRGPN 
QQCFGKRGPN 
QQCFGKRGPN 
QQCFGKRGPN 
TQAFGRRGPE 
DQVFGPRTKG 

I I 



1 I 

375 

H NMGDSD 

H NFGSAG 

K NFGDAE 

A NFGDTD 

A NFGDSD 

A NFGDSD 

A NFGDSD 

Q NFGGPE 

Q NFGGSE 

Q NFGGGE 

Q NFGGGE 

Q NFGGGE 

QTQGNFGDQD 
K-EGNFGDDK 



I I 

385 
LVQNGVDAKG 
WANGVKAKG 
FVEKGVDASG 
LVANGSSAKH 
LVANGNAAKC 
LVANGSSAKH 
LVANGNGAKH 
MLKLGTSDPQ 
MLKLGTSDPQ 
MLKLGTSDPQ 
MLKLGTSDPQ 
MLKLGTSDPQ 
LIRQGTDYKH 
MNEEGIKDGR 



I 



425 



DNV 

NTV 

DSY 

DQI 

DQV 

DQI 

DQI 

GVDBPTKDVY 
GADEPTKDVY 
NPDEPQKDVY 
NPDEPQKDVY 
NLDEPQKDVY 

SGT 

DGL 



I I 

435 

QITYT Y 

VLTFT T 

EITYN Y 

EVTFT H 

KVTLT H 

EVTFT H 

EVTFT H 

ELQYSGAVRF 
ELQYSGAIRF 
ELRYNGAIRF 
ELRYNGAIRF 
ELRYNGAIRF 
WLTYHGAIKL 
HLRFEFTTW 



\ I 

445 
KMLVAKDNKN 
RVTVPKDHPH 
KMTVPKSDPN 
KYHLPKDDPK 
TYYLPKDDAK 
KYHLPKDHPK 
KYHLPKDDPK 
DSTLPGFETI 
DSTLPGFETI 
DSTLSGFETI 
DSTLSGFETI 
DSTLSGFETI 
DDKDPQFKDN 
PCDDPQFDNY 



395 
FPQLAELIPN 
YPQFAELVPS 
YAQIASLAPN 
YPQLAECVPS 
YPQIAECVPS 
YPQLAECVPS 
YPQLAECVPS 
FPILAELAPT 
FPILAELAPT 
FPILAELAPT 
FPILAELAPT 
FPILAELAPT 
WPQIAQFAPS 
VTAMLNLVPS 



I I 

405 
QAALFFDSEV 
TAAMLFDSHI 
VAALLFGGNV 
VSSILFGSYW 
VSSIIFGSQW 
VSSILFGSYW 
VSSILFGSHW 
PGAFFFGSKL 
PSAFFFGSKL 
AGAFFFGSRL 
AGAFFFGSRL 
AGAFFFGSRL 
ASAFFGMSRI 
SHACLFGSRV 



I I 

415 

STDEVG 

VSKESG 

AVRELA 

TSKEDG 

SAEEAG 

TSKEDG 

TAKEDG 

ELVKKN — SG 
ELVKKN — SG 
ELAKVQNLSG 
ELAKVQNLSG 
ELAKVQNLSG 

GMEVTP 

TPKLQL 



I I 

455 
LPKFIEQISA 
LGKFLEELNA 
VELLVSQVDA 
TGQFLQQINA 
TSQFLEQIDA 
TEQFLQQINA 
TGQFLQQINA 
MKVLNENLNA 
MKVLTENLNA 
MKVLNQNLNA 
MKVLNENLNA 
MKVLNENLNA 
VILLNKHIDA 
VKICDQCVDG 



I I 

465 

FTKPS 

FTR 

FKTGN 

YARPS 

YKRPS 

YASPS 

YARPS 

YQNQA 

YQDQA 

YQHQE 

YQQQ 

YQQQ 

YKTFP 

VGTRPKDDEP 



....I.. ..I 

475 
SIKEMQSQSS 
EMQQHPLLNP 
AKLQRKKEKK 
EVAKEQRKRK 
EVAKDQRQRR 
ELAKEQRKRK 
EVAKEQRQRK 
GGADVVSPKP 
GSVDLVSPKP 
DGMMNISPKP 
DC»4MNMSPKP 
DGMMNMSPKP 
PTEPKKDKKK 
KPKSRSSSRP 
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EMCR 

229E 

PEDV 

TGEV 

FeCoV 

PRCoV 

CaCoV 

RSDACoV 

MHV 

PHEV 

OC43 

BoCoV 

SARS 

AIBV 



I I 

485 
HVAQNTVLN- 
SALEFNPSQ- 
NKRETTLQQH 
SRSKSAERS- 
SRSKSADK— 
SRSKSAERS- 
ARSKSVERV- 
QRKRGTKQT- 
PRRGRRQAQ- 
QRQRGQKN — 
QRQRGHKN — 
QRQRGQKN— 
KTDEAQPLP- 
ATRGNSPAPR 



..I.. 
495 



EEAIYDDVGA 



-AQKEELDSI 
-EKKDEVDNV 
--GQVENDNV 
— GQGENDNI 
--GQGENDNI 

QQRPKKEKKL 



505 

-ASIPES 

-TSPATA 

PSDVTHANLE 
EQDWPDALI 
KPEELSVTLV 
EQEVVPDSLI 
EQEWPDALT 
SVAKPKSAVQ 
SVAKPKSLVQ 
SVAAPKSRVQ 
SVAVPKSRVQ 
SVAAPKSRVQ 
QRQKKQPTVT 
KKQDDEAOKA 



I I 

515 
-KPLADDDSA 
-EPVRDEVSI 
WDTAVDGGDT 
ENYTDVFDDT 
EAYTDVFDDT 
ENYTDVFDDT 
ENYTDVFDDT 
RNVSRELTPE 
RNVSRELTPE 
QNKSRELTAE 
QNKSRELTAE 
QNKSRELTAE 
LLPAADMDDF 
LTSDEERNNA 



I I 

525 
IIEIVNEVLH 
ETDIIDEVN- 
AVEIINEIFD 
QVEIIDEVTN 
QVEMIDEVTN 
QVEMIDEVTN 
QVEIIDEVTN 
DRSLLAQILD 
DRSLLAQILD 
DISLLKKMDE 
DISLLKKMDE 
DISLLKKMDE 
SRQLQNSMSG 
QLEFYDEPKV 



..I .. 

535 



TGN- 



DGVVPDGLDD 
DGVVPDGLED 

P YTED 

p YTED 

p YTED 

ASADSTQA — 
INWGDAALGE 



EMCR 

229E 

PEDV 

TGEV 

FeCov 

PRCoV 

CaCoV 

RSDACoV 

MHV 

PHEV 

OC43 

BoCoV 

SARS 

AIBV 



-SNV 
DSNV 
TSEI 
TSEI 
TSEI 

NEL- 



k. S'untranslated region (genomic sequence) 



EMCR5 • UTR 
229E5'0TR 



EMCR5 • OTR 
229E5*UTR 



EMCR5 • UTR 
229E5'UTR 



EMCRS'UTR 
229E5'UTR 



EMCR5 ' UTR 
229E5'UTR 



I I I I ! 1 ....I I 1 I I I 

5 15 25 35 45 55 

AGATAGA GAATTTTCTT ATTTAGACTT TGTGTCTACT 

ACTTAAGTAC CTTATCTATC TACAGATAGA AAAGTTGCTT -TTTAGACTT TGTGTCTACT 

I I I 1 I I I I 1 I I 1 

65 75 85 95 105 115 

CCTCTCAACT AAACGAAATT TTT-CTAGTG CTGTCATTTG TTATG — GCA GTCCTAGTGT 

TTTCTCAACT AAACGAAATT TTTGCTATGG CCGGCATCTT TGATGCTGGA GTCGTAGTGT 



I 



I I ( I I I I I I 

125 135 145 155 165 175 

AATTGAAATT TCGTCAAGTT TGTAA-ACTG GTTAGGCAAG TGTTGTATTT TCTGTGTTTA 
AATTGAAATT TCATTTGGGT TGCAACAGTT TGGAAGCAAG TGCTGTGTGT CCTA-GTCTA 



I 



..I I I I I I I I I I 

185 195 205 215 225 235 

AGCACTGGTG GTTCTGTC-C ACTAGTGCAC AC-ATTGATA CTTAAGT-GG TGTTCTGTCA 
AGGGTTTCGT GTTCCGTCAC GAGATTCCAT TCTACAAACG CCTTACTCGA GGTTCCGTCT 



I 



.1 



I 



I 



.1 I I. ...I ....i.. 

245 255 265 275 285 

CTGCTTATTG TGGAAGCAAC GTTCTGTCGT TGTGGAAACC AATAACTGCT AACC 
CGTGTTTGTG TGGAAGCAAA GTTCTGTCTT TGTGGAAACC AGTAACTGTT CCTA 
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